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Pol a t:6d Application 
4 This application is^a continuation-in-part of USSN 

10 07/897,788, entitled -cloning, Sequencing and 

Characterization of Two Cell Death Genes and Uses 
Therefor" by H. Robert Horvitz, Junying Yuan, and Shai 
Shaham, filed June 12, 1992. The teachings of USSN 
07/897,788 are incorporated by reference. 

15 Background 

Cell death is a fundamental aspect of animal 
development. Many cells die during the normal develop- 
ment of both vertebrates (Glucksmann, Biol. Rev. 
Carriage Pnilos . See. 26:59-86 (1951)) and inverte- 
brates (Truman, Ann . Rev. Neurosci. 7:171-188 (1984)). 
These deaths -appear to function in morphogenesis, 
metamorphosis and tissue homeostasis, as well as m the 
generation of neuronal specificity and sexual 
dimorphism (reviewed by Ellis et al., Ann. Rev. Cell 
Biol 7:663-698 (1991)). An understanding of the 
mechanisms that cause cells to die and that specify 
which cells are to live and which cells are to die is 
essential for an understanding of animal development. 

The nematode Caenorhabditis elegans is an 
appropriate organism for analyzing naturally-occurring 
or programmed cell death (Horvitz et al., Neurosci. 
Comment . 1:56-65 (1982)). The generation of the 959 
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polypeptides, . and to antibodies directed against 
encoded polypeptides. Methods and probes for 
identifying and screening for other cell death genes, 
including those of vertebrates as well as 

5 invertebrates, and possibly, microbes and 

described. Agents which mimic or affect the activ y 
of cell death genes and methods for identi yin 
agents are also described. Bioassays which detect the 
activity of cell death genes and which are 

L0 identifying cell death genes, for testing the effect 
lotions In cell death genes, and for identi ying 
agents which mimic or affect the activity of cell 
genes are also provided. This invention further 

, , j xr__ altering (increasing or 
relates to methods for altering v 

15 decreasing, the activity of the cell death 

their encoded products in cells and, thus for alter 
the proliferative capacity or longevity of a 
population or organism. 

specifically, the ced-3 and «d-« genes of the 

required for almost all the programmed cell deaths 
which occur during development in C. eleaans Thus 
two cell death genes and their encoded products <KHA. 
25 ^peptide, are now -liable for a var ety o ^se . 
As described herein, the ced-3 and ced g 
be US ed to identify structurally related genes from a 
variety of sources. Some of these related genes are 
UK ely to also function as cell death genes. 

. « „,,ted cell death genes, as 

30 structural comparison^ r lated ^ 

well as mutational analysis, can p 
functionally important regions or features of cell 
aeath'genes and gene products. This information is 
death gen , a ents which mimic or which 

useful in the design of agents wi 
35 alter the activity of cell death genes. 
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This invention further provides methods and agents 
for altering (increasing or decreasing) the occurrence 
of cell death in a cell population or organism. 
Methods and agents, described herein, which decrease 
5 cell death are potentially useful for treatment 
(therapeutic and preventive) of disorders and 
conditions characterized by cell deaths, including 
myocardial infarction, stroke, traumatic brain injury, 
degenerative diseases (e.g., Huntington's disease, 

10 amyotrophic lateral sclerosis, Alzheimer's disease, 
Parkinson's disease, and Duchenne's muscular 
dystrophy) , viral and other types of pathogenic 
infection (e.g., human immunodeficiency virus, HIV), 
aging and hair loss. Methods and agents which 

15 increase cell death are also provided and are 

potentially useful for reducing the proliferation or 
size of cell populations, such as cancerous cells,, 
cells infected with viruses (e.g., HIV) or other 
infectious agents, cells which produce autoreactive 

2 0 antibodies and hair follicle cells. Such methods and 

agents may also be used to incapacitate or kill 
undesired organisms, such as pests, parasites, and 
recombinant organisms . 

Brief Description of the Drawings 
25 Figure 1 shows the genomic organization and 

nucleotide sequence (Seq. ID #1) of ced-4 and deduced 
amino acid sequence (Seq. ID #2) . The genomic sequence 
of the ced-4 region was obtained from plasmid C10D8-5, 
which rescues the ced-4 mutant phenotype. Two likely 

3 0 transcriptional start sites are marked with downward 

arrows., The start of the cDNA is marked with a solid 
arrowhead. The positions of eight ced-4 mutations are 
indicated by upward arrows. Numbers on the sides 
indicate nucleotide positions, beginning at the start 
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of C10D8-S. Numbers under the amino acid sequence 
indicate codon positions. Vertical lines between 
nucleotides indicate splice junctions. 

Figure 2 shows the genomic structure of the ced 
gene and positions of ced-4 stations. The sizes of 
exons and introns are indicated in base pairs (bpW 
The downward arrows indicate the positions of the Tc 
insertion in the »<«-« ,«!«*> and of eight EMS 

induced stations of ced-, . The arrow pointing . : ght 
indicates the direction of transcription The s id 
arrowhead indicates the translation 
The open arrowhead indicates the ochre terminate 

QJ^'w ' shows the sequence similarities between 
^hfced^protein and some calcium-binding proteins 
The consensus sequence of the calcium-bindin, loop is 

shown at ,he top. The positions *' 

a T/PT-hices of an octahedron. 
-X and -Z correspond to vertices or 

'v. fho v Y Z ~X and -Z correspond to 
The numbers above the X, Y, &i A 

•JL^ of the residues within the 29 ammo acid 
0 the positions of the r indic ated by the 

EF-hand sequence. Amino acias 

single letter code. 0, amino acid with an oxygen- 
containinl side chain. non-conserved amino acid. 
Positions V Z and -X can be any amino acid with 
25 oxygen-conLining side chains. Position X x. usually 
aspartic af id, and position -Z is usually ,l»t» c 

l * *™ino acids are shown m bold-face, 
acid Conserved ammo aciab ai 

l^oTn the EF-hand consensus sequence are 
Deviations! from tne 

underlined! Tn 

Figure 4 shows the nucleotide sequence (Seq. 
,»> of ced-3 and deduced amino acid sequence (Seq. 
, 19 The genomic sequence of the ced-3 reg.on was 

i nfi07 which rescues the ced-3 

obtained from plasmid pJ107, wni 

^ ™ a The likely translation initiation 
mutant phenotype. The liKeiy SD lice 
. hv a solid arrowhead. The SL1 spnc« 

35 site is indicated by a soiia 
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acceptor of the RNA is boxed. The positions of 12 
ced-3 mutations are indicated. Repetitive elements in 
the introns are indicated as arrows above the relevant 
sequence. Numbers on the sides indicate nucleotide 
positions, beginning with the start of pJ107. Numbers 
under the amino acid sequence indicate codon positions. 

Figure 5A shows the genomic structure of the ced-3 
gene and the location of the mutations. The sizes of 
the introns and exons are given in bp. The downward 
arrows indicate the positions of 12 EMS-induced 
mutations of ced-3. The arrow pointing right indicates 
the direction of transcription. The solid arrowhead 
indicates the translation initiation site. The open 
arrowhead indicates the termination codon. 
15 Figure 5B shows the locations of the mutations 

relative to the exons (numbered 1-8) and the encoded 

serine-rich region. ' 

Figure 6 is a Kyte-Doo little hydrophobicity plot 

of the Ced-3 protein. 

Figure] 7 shows a comparison of the Ced-3 proteins 
of C. elegans (line 1) and related nematodes, C. 
briggsae (line 2) and C. vulgaris (line 3). The 
conserved anino acids are indicated by " . " . Gaps 
inserted in the sequence for the purpose of alignment 

25 are indicated by "_" . 

Figure 8 shows a restriction site map of the ced-4 
region and the relative positions of plasmid C10D8-5, 
plasmid insert pnl416, and three transcripts encoded by 
the region. 

3 0 Figure 9 shows physical and genetic maps of the 

ced-3 region on chromosome IV. 

Figure 10 summarizes experiments to localize ced-3 
within C48D1. Restriction sites of plasmid C48D1 and 
subclone plasmids are shown, ced-3 activity was scored 

3 5 as the number of cell corpses in the head of LI young 
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animals. ++, the number of cell corpses above 10. +, 
the number of cell corpses below 10 but above 2. 
the number of cell corpses below 2. 

pg-t-ailed Descr *p-H"n of * hp Tnvention 
5 The ced-3 and ced-4 genes of C. elegans have been 

shown to be required for almost all programmed cell 
deaths in C. elegans development (Ellis and Horvitz , 
Cell 44:817-829 (1986)). The present work describes 
the cloning, sequencing and characterization of these 
10 genes. As a result of this work, two genes whose 
activities are required for cell death, referred to 
herein as cell death genes, and their encoded products 
(RNA, polypeptide) are available for a variety of uses. 
Described below are the cloning and characterization of 
15 the C. elegans ced-4 and ced-3 genes, methods and 
probes for identifying structurally related genes,, 
methods for identifying cell death genes from a variety 
of organisms', methods for identifying agents which 
mimic or which affect the activity of cell death genes, 
20 and methods and agents for altering cell death activity 
and thus,' for altering the occurrence of cell death m 
a cell population or organism. 

The activity of a cell death gene is intended to 
include the activity of the gene itself and of the 
25 encoded products of the gene. Thus, agents and 

mutations which affect the activity of a gene include 
those which affect the expression as well as the 
function of the encoded RNA and protein. The agents 
may interact with the gene or with the RNA or protein 
3 0 encoded by the gene, or may exert their effect more 
indirectly. 
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jhe r*d-4 Gene characterization of 

The cloning, .e^encxn, =^a= te ^ 

the C. elegans £> ™ f rom a ce d -. mutant allele- 

Genomic clones were obtain.* fro containing 

g enerated by transposon tagging A ^clo ^ ^ 

as little as 4.4 kb of wild-type g.no 

to complement the ced-4 mutant phenotype (see 

tables are located at the end of the Detailed 

transcript Tbe ^ ^ 'that 

normal levels in a ce<3 

is not a transcriptional regul t r of c , _ 
egression. -^"^^L^ embryogenesis . 

shown to be ^ S t S ^t h ""e bservation that X» of the 
This is consistent with th embryonic 

131 programmed cell deaths in C. ; 

(su lston and Horvit, Dev. « ; 

Sulston et al., ^ . »^ and sequenced . 

cdha : rr. * indicates 

Analysis of the cDNA an ^.^ 

that the putative about 62 ,s77 in 

length (Figure l; Seg. ID W ^ highly 

relative molecular mass. The ce P ^ 
5 nydrophiliC with a predicted pi ^ 

odious "-smembrane region ^ Tfc. ^ ^ 
region is a segment of 12 ammo 

t0 393 * , • o-f the ced-4 genomic clone and 

seguence ana V- of - = ^ ^ 

30 comparison with t h ^ , ^ ^ ^ ^ 

ced-4 gene contains / 

,4 bp to 557 bp (Figure 2) . ms . indU ced 

The nucleotide sequences of eight tn 
^mutations were also determined. Of the eight 
„ Stations, one results in a single amino acid 
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substitution and the other seven appear to prevent 
either ced-4 RNA splicing or completion of Ced-4 
protein synthesis (Figure 2 and Table 2). These seven 
mutations establish the null phenotype of the ced-4 
gene, confirming that ced-4 function is not essential 

for viability. 

Two regions of the inferred Ced-4 protein have 
sequence similarity to known calcium-binding domains 
(Kretsinger, Cold Spring Harbor Symp. Quant. Biol. 
52:499-510 (1987)), suggesting that Ced-4 activity and 
hence, programmed cell death may be modulated by 
calcium (see Figure 3 and Example 1) . Calcium has been 
implicated as an essential mediator of cell death m 
other organisms under a variety of conditions. For 
15 example, extracellular calcium is required for 

glucocorticoid-induced thymocyte death (Cohen and Duke, 
j. Immunol. 232:38-42 (1984)), fcr the deaths of adult 
rat hepatocytes induced by certain toxins in vitro 
(Schanne et al . , Science 206:700-702 (1979)), for 
agonist-induced muscle degeneration in mice (Leonard 
and Salpeter, J. Cell Biol. 82:811-819 (1979)) and for 
neuronal cell death caused by oxygen deprivation or 
excitotoxicity (Coyle et al., Neurosci. Res. Prog. 
Bull. 29:331-427 (1981); Choi, J • Neurosci . 7:369-379 
(1.987), Choi, Trends Neurosci. 12:465-469 (1988)). It 
is possible that programmed cell death is initiated 
during C. elegans development by an increase in 
intracellular calcium, which activates the Ced-4 
protein to become cytotoxic. On the other hand, 
certain cells seem to be protected against cell death 
by calcium (e.g., Koike et al., Proc. Natl. Acad. Sci . 
USA 85:6421-6425 (1989); Collins et al . , J. Neurosci . 
12:25^-2587 (1991)), suggesting that increases in 
intracellular calcium levels may inhibit the activity 
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of the Ced-4 Protein and thereby prevent programmed 

cell death. transC ript in eggs is about 

The level of the cea * „-i ativelY 

a .t that of the actin 1 transcript, whxch x. r latxve y. 
abundant (Edwards and Wood. ,ev. Bid. «:»5-»o 
(1M 3)). This level seems higher than U «ht be 
Reeled if ~,-4 were expressed only in ^V-, cells, 
s fnce in an embryo there are usually no more than 
or three cells dying at the same time 
considerations suggest that m.ght be transcr 

not only in dying cells but in other cells as well 
Perhaps Ced-4 activity, at least during -*ryonxc 
development, is regulated at a P-^^ 
level. ,or example, the Ced-4 protean m ght h ^ ^ 
interact with other protons or other factor 
calcium, to cause cell death. Since the ced 3 g 
a l so essential for programmed cell death in C.J^ 9 
one possibility is that the activity of the Ced 
protein is dependent upon ced-3 function. 

The ^g»r7-3 Gene . , . _ ~f 

SL -^^I- g, sequencing and <*.r.=t«rx^o„^f 

th e ced-3 gene are ™ ^ „ ^gment 

gene was cloned by mapping DNA restn A 
, r*FLPs) and chromosome walking. 

lei, mutant phenotypes. und to 

. . ^ ♦ j The ced-3 transcript 
further identified. The ^ 

be most abundant in embryos, bur w t 

H„its suqgesting that ced-3 is not 
xo larvae and young adults, sugge ^ 

, in cells undergoing programmed cell 
only expressed m ceiis ui 

dea ^* „ f n <-hP ced-3 mRNA was 

A 2 . 5 kb cDNA corresponding 

fenced. The genomic » ^ _ with the cDNA 

3 5 (Figure 4; Seq. ID #18) ana a 




sequence revealed that the ced-3 gene has 8 introns 
which range in size from 54 to 1195 bp (Figure 5A) . 
The four largest introns as well as sequences 5 ' of the 
start codon contain repetitive elements, some of which - 
5 have been previously characterized in non-coding 

regions of other C. elegans genes such as fem-1 (Spence 
et al., Cell 60:981-990 (1990)), lin-12 (J. Yochem, 
personnal communication), and myoD (Krause et al . , Cell 
53:907-919 (1990)). The transcriptional start site was 

10 also mapped, and the ced-3 transcript was found to be 
trans-spliced to a C. eJegans splice leader, SL1. 

Twelve EMS-induced ced-3 alleles were also 
sequenced. Eight of the mutations are missense 
mutations, two are nonsense mutations, and two are 

15 putative splicing mutations (Table 3) . The molecular 
nature of these mutations, together with results of 
genetic and developmental analyses of nematodes 
homozygous for these mutations, indicate that, like 
ced-4, ced-3 function is not essential to viability. 

20 In addition, 10 out of the 12 mutations are clustered 
in the C-terminal region of the gene (Figure 5B) , 
suggesting that this portion of the encoded protein may 
be important for activity. 

The ced-3 gene encodes a putative protein of 503 

25 amino acids (Figure 4; Seq. ID #19). The protein is 
very hydrophilic and no significantly hydrophobic 
region can be found that might be a transmembrane 
domain (Figure 6) . One region of the ced-3 protein is 
very rich in serine. Sequence comparison of two 

3 0 additional ced-3 genes from related nematodes, C. 
briggsae and C. vulgaris, suggests that the exact 
sequence in this serine-rich region may not be 
important but that the serine-rich feature is (Figure 
7; Seq. ID #19-21) . This hypothesis is supported by 

3 5 the analysis of ced-3 mutations: none of 12 EMS-induced 
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ced-3 mutations is in the serine-rich region (Figure 
5B) . 

The conservation of the serine-rich feature among 
the ced-3 genes of different nematodes suggests that 
5 the serine-rich region may act in semi-specific 

protein-protein interactions, similar to acid blobs in 
transcription factors and basic residues in nuclear 
localization signals. In all these cases, the exact 
primary sequence is not important. 
10 It is possible that the serine residues in the 

Ced-3 and Ced-4 proteins may be targets for a Ser/Thr 
kinase, and that the activity of these proteins may be 
regulated post-translationally by protein 
phosphorylation. McConkey et al . (J. Immunol., 
15 145:1227-1230 (1990)) have shown that phorbol esters, 
which stimulate protein kinase C, can block the death 
of cultured thymocytes induced by exposure to Ca ++ , 
ionophores or glucocorticoids (Wyllie, Nature 284:555- 
556 (1980); Wyllie et al., J- Path. 142:67-77 (1984)). 
20 It is possible that protein kinase C may inactivate 
certain cell death proteins by phosphorylation, and 
thus, inhibit cell death and promote cell 
proliferation. Several agents that can elevate 
cytosolic cASP levels have been shown to induce 
25 thymocyte death, suggesting that protein kinase A may 
also play a role in mediating thymocyte death. Further 
evidence suggests that abnormal phosphorylation may 
play a role in the pathogenesis of certain cell- 
degenerative diseases. For example, abnormal 
30 phosphorylation of the microtubule-associated protein 
Tau is found in the brains of Alzheimer's disease and 
Down's syndrome patients (Grundke-Iqbal et al., Proc. 
Natl. Acad. Sci. USA 83:4913-4917 (1986); Flament et 
al., Brai* Res. 515:15-19 (1990)). Thus, it is 
possible that phosphorylation may have a role in 
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regulating programmed cell death in C. elegans. This 
is consistent with the fairly high levels of ced-3 and 
ced-4 transcripts which suggest that transcription 
regulation alone may be insufficient to regulate 
5 programmed cell death. 

Structurally and F unctionally Related Genes 

As a result of the work described herein, it is 
possible to identify genes which are structurally 
10 and/or functionally related to ced-3 or ced-4. Such 
genes are expected to be found in a variety of 
organisms, including vertebrates (e.g., mammals and 
particularly humans), invertebrates (e.g., insects), 
microbes (e.g., yeast) and possibly plants. 
15 Structurally related genes refer herein to genes which 
have some structural similarity to the nucleotide 
sequences (genomic or cDNA) of one or both of the ced-3 
or ced-4 genes, or whose encoded proteins have some 
similarity to one or both of the amino acid sequences 
20 of the Ced-3 or Ced-4 proteins. Functionally related 
genes refer to genes which have similar activity to 
that of ced-3 and ced-4 in that they cause cell death. 
Such genes can be identified by their ability to 
complement ced-3 or ced-4 mutations in bioassays, as 
25 described below. 

Previous studies are consistent with the 
hypothesis that genes similar to the C. elegans ced-3 
and ced-4 genes may be involved in the cell deaths that 
occur in both vertebrates and invertebrates. Some 
3 0 vertebrate cell deaths share certain characteristics 
with the programmed cell deaths in C. elegans that are 
controlled by ced-3 and ced-4. For example, up to 14% 
of the. neurons in the chick dorsal root ganglia die 
immediately after their births, before any signs of 
35 differentiation (Carr and Simpson, Dev. Brain Res. 



2:57-162 (1982)), Genes like ced-3 and ced-4 could 
well function in this class of vertebrate cell death. 
In addition, genes related to ced-3 and ced-4 could 
function in many other types of vertebrate cell death 
5 processes, including those involving cells that die 
long after their births and those that die as a result 
of stress (e.g., oxygen deprivation) or disease. 

Genetic mosaic analysis has suggested that the 
ced-3 and ced-4 genes act within cells that undergo 

10 programmed cell death, rather than through cell-cell 
interactions or diffusible factors (Yuan and Horvitz , 
Dev. Biol. 138:33-41 (1990)). Many cell deaths in 
vertebrates seem different in that they appear to be 
controlled, by interactions with target tissues. For 

15 example, it is thought that a deprivation of target- 
derived growth factors is responsible for vertebrate 
neuronal cell deaths (Hamburger and Oppenheim, 
Neurosci. Comment. 1:39-55 (1982)); Thoenen et al . , in: 
Selective Neuronal Death, Wiley, New York, 1987, Vol. 

20 126, pp. 82-85) . However, even this class of cell 

death could involve genes like ced-3 and ced-4, since 
pathways of cell death involving similar genes and 
mechanisms might be triggered in a variety of ways. 
Supporting this idea are several in vitro and in vivo 

25 studies which show that the deaths of vertebrate as 
well as invertebrate cells can be prevented by 
inhibitors of RNA and protein synthesis, suggesting 
that activation of genes is required for these cell 
deaths (Martin et al . , J . Cell Biol. 106:829-844 

30 (1988); Cohen and Duke, J. Immunol . 132:38-42 (1984); 
Oppenheim and Prevette, Neurosci. Abstr . 14:368 (1988); 
Stanisip et al . , Invest. Urol. 16:19-22 (1978); 
Oppenheim et al . , Dev. Biol. 138:104-113 (1990); 
Fahrbach and Truman, in: Selective Neuronal Death, Ciba 

35 Foundation Symposium, 1987, No. 126, pp. 65-81). It is 
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possible that the genes induced in these dying 
vertebrate and invertebrate cells are cell death genes 
similar to the C. elegans genes ced-3 and ced-4 . 

Also supporting the hypothesis that cell death in 
C. elegans is mechanistically similar to cell death in 
vertebrates is the observation that the protein product 
of the C. elegans gene ced-9 is similar in sequence to 
the human protein Bcl-2 . ced-S> has been shown to 
prevent cells from undergoing programmed cell death 
during nematode development by antagonizing the 
activities of ced-3 and ced-4 (Hengartner, et al . , 
Nature 356:494-499 (1992)). The bcl-2 gene has also 
been implicated in protecting cells against cell death. 
It seems likely that the genes and proteins with which 
ced-9 and bcl-2 interact are similar as well. 

Genes which are structurally related to ced-3 or 
ced-4 are likely to also act as cell death genes. •• 
Structurally, related genes can be identified by any 
number of detection methods which utilize a defined 
nucleotide or amino acid sequence or antibodies as 
probes. .For example, nucleic acid (DNA or RNA) con- 
taining all or part of the ced-3 or ced-4 gene can be 
used as hybridization probes or as polymerase chain 
reaction (PCR) primers. Degenerate oligonucleotides 
derived from the amino acid sequence of the Ced-3 or 
Ced-4 proteins can also be used. Nucleic acid probes 
can also be based on the consensus sequences of 
conserved regions of genes or their protein products. 
In addition, antibodies, both polyclonal and 
monoclonal, can be raised against the Ced-3 and/or Ced- 
4 proteins and used as immunoprobes to screen 
expression libraries of genes. 

One strategy for detecting structurally related 
genes in other organisms is to initially probe animals 
which are taxonomically closely related to the source 
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of the probes, for example, probing other worms with a 
ced-3 or ced-4 probe- Closely related species are more 
likely to possess related genes or gene products which 
are detected with the probe than more distantly related- 
5 organisms. Sequences conserved between ced-3 or ced-4 
and these new genes can then be used to identify 
similar genes from less closely related species. 
Furthermore, these new genes provide additional 
sequences with which to probe the molecules of other 

10 animals, some of which may share conserved regions with 
the new genes or gene products but not with ced-3 , 
ced-4, or their gene products. This strategy of using 
structurally related genes in taxonomically closer 
organisms as stepping stones to genes in more distantly 

15 related organisms can be referred to as walking along 
the taxonomic tree. 

Groups of structurally related genes, such as^ 
those obtained by using the above-described strategy, 
can be referred to as gene families. Comparison of 

20 members within a gene family, or their encoded 

products , . may indicate functionally important features 
of the genes or their gene products. Those features 
which are conserved are likely to be significant for 
activity. Such conserved sequences can then be used 

2 5 both to identify new members of the gene family and in 

drug design and screening. For example, as described 
in Example 2, genes similar to ced-3 from two other 
species of nematodes (C. briggsae and C. vulgaris) were 
identified and characterized. Serine-rich regions were 

3 0 found in the polypeptides encoded by all three genes. 

Although the sequence of the serine-rich region was not 
well qppserved, the number of serines was conserved, 
suggesting that the serine-rich feature, but not the 
exact sequence of the serine-rich region, is 
35 significant for function. 




Functionally important regions can also be 
identified by mutagenesis. For example, inactivating 
mutations of ced-3 were found to cluster within a 
region near the COOH-terminus (Figure 5B) , suggesting 
5 that this region is a functionally important domain of 
the Ced-3 protein* Further mutational analyses can be 
carried out on the ced-3 and ced-4 genes; mutants with 
novel properties, as well as other regions important 
for activity, may be discovered. Mutations and other 
10 alterations can be accomplished using known methods, 
such as in vivo and in vitro mutagenesis (see, e,g., 
Ausubel et al . (eds.)/ Current Protocols in Molecular 
Biology, Greene Publishing Associates and Wiley- 
Interscience , New York) . 

15 Bioassavs and Agents Which Affect the A ctivity of Cell 
Death Genes 

This invention further provides bioassays which 
detect the activity of cell death genes. The bioassays 
can be used to identify novel cell death genes, to 

20 identify mutations which affect the activity of cell 
death genes, to identify genes which are functionally 
related to known cell death genes, such as ced-3 or 
ced-4, to identify genes which interact with cell death 
genes, and to identify agents which mimic or affect the 

25 activity of cell death genes (e.g., agonists and 

antagonists) . For example, the bioassays can be used 
to screen expression gene libraries for cell death 
genes from other organisms. 

In this bioassay, genes or agents are introduced 

30 into nematodes to test their effect on cell deaths in 
vivo. Wild-type, mutant, and transgenic nematodes can 
be used as appropriate for the effect being tested. In 
one embodiment of this bioassay, transgenic nematodes 
are produced using a candidate cell death gene, a 
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mutant cell death gene, or genes from an expression 
library, to observe the effect of the transgene on the 
pattern of programmed cell deaths during development of 
the nematode. For example, a gene which is 
5 structurally related to ced-3 can be used to produce a 
transgenic animal from a mutant nematode which 
underexpresses or expresses an inactivated ced-3 gene 
to see if the related gene can complement the ced-3 
mutation and is thus, functionally as well as 
10 structurally related to ced-3. cDNA or genomic 

libraries can be screened for genes having cell death 
activity. Genes which interact with cell death genes 
to enhance or suppress their activity can also be 
identified by this method. 
15 In another embodiment of the bioassay, wild-type, 

mutant, or transgenic nematodes are exposed to or 
administered peptides and other molecules in order -to 
identify agents that mimic, increase, or decrease the 
activity of a cell death gene. For example, wild-type 
2 0 animals can be used to test agents that inactivate or 
antagonize the activity of ced-3 or ced-4 and hence, 
decrease cell deaths, or that activate or enhance ced-3 
or ced-4 activity and increase cell deaths. Mutant 
animals in which ced-3 or ced-4 is inactivated can be 
25 used to identify agents or genes which mimic ced-3 or 
ced-4 in causing cell deaths. Mutant animals in which 
ced-3 or ced-4 is overexpressed or constitutively 
activated can similarly be used to identify agents that 
prevent ced-3 or ced-4 from causing cell death. 
Transgenic animals in which a wild-type or mutant form 
of an exogenous cell death gene causes excess cell 
deaths. due to overexpression or hyperactivity can be 
used to identify agents that inactivate or inhibit the 
activity of the transgene. Similarly, transgenic 
animals in which a wild-type or mutant form of an 
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exogenous cell death gene is underexpressed or inactive 
can be used to identify agents that activate or 
increase its activity. Test molecules can be 
introduced into nematodes by microinjection, diffusion,* 
5 ingestion, shooting with a particle gun, or other 
method. 

Mutated cell death genes with novel properties may 
be identified by the above bioassay. For example, 
constitutively activated or hyperactive cell death 

10 genes may be isolated which may be useful as agents to 
increase cell deaths. Mutations may also produce genes 
which do not cause cell death but which antagonize the 
activity of the wild-type gene. 

Agents can be obtained from traditional sources, 

15 such as extracts (e.g., bacterial, fungal or plant) and 
compound libraries, or by newer methods of rationale 
drug design. Information on functionally important 
regions of the genes or gene products, gained by 
sequence and/or mutational analysis, as described 

20 above, may provide a basis for drug design. The 

activity. of the agents can be verified both by in vivo 
bioassays using nematodes which express various forms 
of ced-3, ced-4, or related genes, as described above, 
and by in vitro systems, in which the genes are 

25 expressed in cultured cells, or in which isolated or 
synthetic gene products are tested directly in 
biochemical experiments. The agents may include all or 
portions of the ced-3, ced-4 , or related genes, mutated 
genes, and all or portions of the gene products (RNA, 

30 including antisense RNA, and protein) , as well as 
nucleic acid or protein derivatives, such as 
oligonucelotides and peptides, peptide and non-peptide 
mimetics, and agonists and antagonists which affect the 
activity or expression of the cell death genes. The 

35 agents can also be portions or derivatives of genes or 
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gene products which are not cell death genes but which 
regulate the expression of, interact with, or otherwise 
affect the function of cell death genes or gene 
products . 

tt^p?; of the Invention 

Using the above-described probes and bioassays, 
the identification and expression of ced-3 , ced-4 or 
related cell death genes in cultured cells, tissues, 
and whole organisms can be studied to gain insights 
into their role in development and pathology in various 
organisms. For example, the detection of abnormalities 
in the sequence, expression, or activity of a cell 
death gene or gene product may provide a useful 
diagnostic for diseases involving cell deaths. 
L5 This invention further provides means of altering 

or controlling the activity of a cell death gene m a 
cell, and, thus, affecting the occurrence of cell 
death. Activity of the cell death gene can be altered 
to either increase or decrease cell deaths m a 
20 population of cells and, thus, affect the proliferative 
capacity or longevity of a cell population, organ, or 

entire organism. 

Agents which act as inactivators or antagonists of 
the activity of ced-3, ced-4, or other cell death genes 
25 can be used to prevent or decrease cell deaths. Such 
agents are useful for treating (i.e., for both 
proventive and therapeutic purposes) disorders and 
conditions characterized by cell deaths, including 
neural and muscular degenerative diseases, stroke, 
30 traumatic brain injury, myocardial infarction, viral 
(e g.V : 'HIV) and other types of pathogenic infections, 
as well as cell death associated with normal aging and 
nair loss. The agent can be delivered to the affected 
ceUs by various methods appropriate for the cells or 
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organs being treated, including gene therapy. For 
example, anti-sense RNA encoded by all or a part of a 
cell death gene which is complementary to the mRNA can 
be delivered to a population of cells by an appropriate * 

5 vector, such as a retroviral or adenoviral vector, or 
an antagonist of cell death activity can be infused 
into a wound area to limit tissue damage. 

Methods and agents which cause or increase cell 
deaths are also useful, for example, for treating 

10 disorders characterized by an abnormally low rate or 

number of cell deaths or by excessive cell growth, such 
as neoplastic and other cancerous growth. Such methods 
and agents are also useful for controlling or 
eliminating cell populations, such as cells infected 

15 with viruses (e.g., HIV) or other infectious agents, 
cells producing autoreactive antibodies, and hair 
follicle cells. In addition, methods and agents which 
increase cell death can be used to kill or incapacitate 
undesired organisms, such as pests, parasites and 

20 genetically engineered organisms. All or portions of 
ced-3, ce.d-4, or related cell death genes, active 
mutant genes, their encoded products, agents which 
mimic the activity of cell death genes, and activators 
and agonists' of cell death genes can be used for this 

2 5 purpose. 

For example, cell death genes can be used to kill 
cells infected with the human immunodeficiency virus 
(HIV), and thus, prevent or limit HIV infection in an 
individual. A recombinant gene can be constructed, in 
30 which a cell death gene is under the control of a viral 
promoter which is specifically activated by a viral 
protein; the recombinant gene is introduced into HIV 
infected cells. HIV-infected cells containing the 
viral activator protein would express the cell death 




gene product and be killed, and uninfected cells would 
be unaffected • 

Alternatively, an antagonist of ced-3 or ced-4 
activity (such as antisense RNA) can be expressed under- 
5 the control of a viral-specific promoter and in this 
way, be used to prevent the cell death associated with 
viral (e.g., HIV) infection. 

In another example, cell death genes can be used 
as suicide genes for biological containment purposes. 

10 Genetic engineering of suicide genes into recombinant 
organisms has been reported in bacteria (Genetic 
Engineering Hews , Nov. 1991, p. 13) : suicide genes 
were engineered to be expressed simultaneously with the 
desired recombinant gene product so that the 

15 recombinant bacteria die upon completion of their task. 
The present invention provides suicide genes which are 
useful in a variety of organisms in addition to , 
bacteria, for example in insects, fungi, and transgenic 
rodents. Suicide genes can be constructed by placing 

2 0 the coding sequence of an exogenous cell death gene or 
an agonist of an endogenous cell death gene of the 
organism in an expression vector suitable for the 
organism. 

In addition, agents which increase cell death are 
25 useful as pesticides (e.g., anthelminthics , 

nematicides) . For example, many nematodes are human, 
animal, or plant parasites. ced-3, ced-4, or other 
nematode cell death genes, their gene products, 
mimetics, and agonists can be used to reduce the 
30 nematode population in an area, as well as to treat 
individuals already infected with the parasite or 
protect individuals from infection. A transgenic plant 
or animal carrying a constitutively activated ced-3 
gene, ced-4 gene, or other cell death gene specific to 




nematodes can be protected from nematode infection in 
this way. 

The subject invention will now be illustrated by 
the following examples, which are not intended to be 
5 limiting in any way. 2 

EXAMPLE 1 

CLONING, SEQUENCING AND CHARACTERIZATION OF 
THE CED-4 GENE 

MATERIALS AND METHODS 

10 General Methods and Strains 

Techniques used for the culturing of C. elegans 
were essentially as described by Brenner (Genetics 
77:71-94 (1974)). All strains were grown at 20°C. DNA 
was prepared from worms grown on Petri dishes , 

15 containing agarose seeded with E. coli strain HB101. 
RNA was prepared from mass cultures grown in liquid. 
Usually, the bacterial pellet from a 2 L overnight 
culture of E. coli HB101 grown in superbroth (12 g 
Bacto-tryptone, 24 g yeast extract, 8 ml 50% glycerol, 

20 900 ml H 2 0; after autoclaving, 100 ml 0.17 M KH 2 HP0 4 

and 0.72 K 2 HP0 4 were added) was resuspended in 500 ml S 
basal medium (Brenner, 1974 supra) , and worms were 
added from one or two 10 cm Petri dishes in which the 
bacterial lawns had just been consumed. Worms were 

25 harvested about 4-5 days later by centrif ugation and 
washed in M9 buffer (Brenner, 1974 supra) . The yield 
was about 5-10 ml of packed worms. 

Nomarski differential interference contrast 
microscopy was used to examine individual cells in 

30 living' nematodes (Sulston and Horvitz, Dev. Biol. 
82:110-156 (1977)). Methods for scoring the Ced 
phenotype of ced-1, ced-4 and ced-1 ; ced-4 double 




mutants have been described by Ellis and Horvitz, (Cell 
44:817-829 (1986)) and by Yuan and Horvitz, (Dev. Biol. 
138:33-41 (1990) ) . 

The wild-type parent of all mutant strains used in . 
5 these experiments was C. elegans variety Bristol strain 
N2 (Brenner, 1974 supra) . The genetic markers used are 
listed below. These markers have been described 
(Brenner, 1974 supra; Hodgkin et al . , in: The Nematode 
Caenorhabditis elegans, Wood and the Community of C. 
10 elegans Researchers (eds.)r Cold Spring Harbor 

Laboratory, New York, 1988, pp. 491-584; Finney et al - , 
Cell 55:757-769 (1933)). The strain TR679 carries the 
mutator mut-2 (r459) (Collins et al . , Nature 328:726-723 
(1987)). The ced-4 alleles n!894, n!920, nl947 , nl948 / 
15 n2247 , and n2273 were characterized in the present 

work. Genetic nomenclature follows the standard system 
for C. elegans (Horvitz et al., Mol . Gen. Genet. , 
275:129-133 (1979)): 

LG I: ced-1 (el 735; , unc-54(r323) 

20 LG III: unc-86 (nl351) , ced-4 (n!162, n!416, 

nl894, n!920, n!947 f n!94B f n2247, 
n2273, nl416 n!712, nl416 nl713) , 
unc-79 (e!068) , dpy-17 (e!64) 
LG IV: '* unc-31(e928) , ced-3 (n717) 
25 LG V: egl-l(n986) , unc-76 (e911) 

Genomic Libraries 

A 4-6 kb size-selected phage library was 
constructed from ced-4 (n!416) DNA as follows. Genomic 
DNA was digested with Hindlll and run on a low-melting 
3 0 agarose gel. DNA migrating within the 4-6 kb size 
range was excised, and the low-melting agarose was 
removed by phenol extraction and precipitation 
(Maniatis et al . , Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory (1983)). These 



DNA fragments were ligated to Hindlll-digested DNA from 
phage A.NM1149 (Murray, Phage Lambda and Molecular 
Cloning, Cold Spring Harbor Laboratory, 1983, pp. 395- 
432) . The product DNA was packaged with packaging 
extract from Promega. This library had a total of 
140,000 plaque-forming units (pfu) , of which 70% were 
recombinants, as estimated from the ratio of pfu on 
bacteria C600hfl and C600. 

The phage genomic library (provided by J . Sulston) 
was prepared by partial digestion of wild-type C. 
elegans genomic DNA with Sau3A and cloning into the 
BamHI site of phage vector A.2001 (Karn et al . , Gene 
32:217-224 (1984)). 

Tc4 Probe 

The Tc4 probe used for cloning the ced-4 gene and 
for Southern blots was Tc4-.n2.352, which contains a ,Tc4 
element isolated from an unc-86 (nl351) mutant strain 
(Finney et al . , Cell 55:757-769 (1988); Yuan et al . , 
Proc. Natl. Acad. Sci. USA 88:3334-3338 (1991)). DNA 
was labelled with 32 P using either the nick-translation 
procedure described by Maniatis et al . (1983 supra) or 
the oligo-labelling procedure described by Feinberg and 
Vogelstein (Anal. Biochem, 132:6-13 (1983)). 

RNA Preparation. Northern Blot and Primer Extension 

Total C. elegans RNA was extracted using guanidine 
isothiocyanate (Kim and Horvitz, Genes & Dev. 4:357-371 
(1990)). Poly(A) + RNA was selected from total RNA by a 
poly (dT) -column (Maniatis et al . , 1983 supra). To 
prepare stage-synchronized animals, eggs were obtained 
from gravid C. elegans adults grown at 20°C in liquid 
culture. A 5 - 10 ml sample of animals was treated 
with 50 ml of NaOCl/NaOH solution (10 ml NaOCl, 1 g 
NaOH, 4 0 ml H 2 0) for about 10 minutes with vortexing 
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until the adults were dissolved. Eggs were centrifuged 
and washed three times with M9 buffer. Isolated eggs 
were allowed to hatch in S basal medium without food 
for 14 hours at 20°C with shaking. LI larvae were 
5 collected by low-speed centrif ugation after growth on 
E. coli HB101 for 2 hours , L2 larvae after 12 hours, L3 
larvae after 24 hours, L4 larvae after 3 6 hours and 
adults after 48 hours. Northern blot analysis using 
DNA probes was performed essentially as described by 
10 Meyer and Casson (Genetics 105:29-44 (1986)), except 
that RNA was transferred from the gel to the Gene 
Screen filter (DuPont, Wilmington, DE) by capillary 
action. 

Quantitation of ced-4 expression during embryonic 

15 development was done by hybridizing two duplicate 

northern blots with ced-4 cDNA clone SK2-2 and with a . 
genomic DNA clone for the actin 1 gene, pW-16-210, , 
which hybridizes to the 3' untranslated region of the 
actin 1 transcript (Krause and Hirsh, in: Molecular 

20 Biology of the Cytoskeleton , Borisy et al . (eds.), Cold 
Spring Harbor Laboratory, 1984, pp. 287-292). The two 
probes were of the same specific activity (4 x 10 3 
counts/minute//ig) . The emission of B particles from 
the ced-4 and actin 1 bands was counted using a B 

25 counter (Betagen, Waltham, MA). The readings were 7.7 
counts/minute for the actin 1 band and 1.4 counts/ 
minute for the ced-4 band. 

The primer extension protocol was that of Sambrook 
et al. (Molecular Cloning: A Laboratory Manual, 2nd 

30 edition, Cold Spring Harbor Laboratory, 1989, pp. 7.79- 
7.83), using the primer ATTGGCGATCCTCTCGA (Seq. ID 
#22) . To define the lengths of the reaction products, 
a sequencing reaction using this primer and C10D8-5 as 
template was run adjacent to the product of the primer 

35 extension reaction in the sequencing gel. 



Direction of Transcription 

The direction of transcription was determined by 
hybridizing northern blots with single-stranded RNA 
probes- The Bluescribe plasmid containing the insert 
pnl416 was linearized by digestion with either BamHI or 
ff±ndIII, which cleaved at one or the other end of the 
insert- The linearized product was transcribed using 
T3 or T7 RNA polymerase, respectively, generating RNA 
from each strand. These RNA products were used to 
probe Northern blots according to a protocol developed 
by Z. Liu and V. Ambros: Filters were prehybridized in 
50% formamide, 50 mM sodium phosphate (pH 6.5), 5 x 
SSC, 8 X Denhardt's, 0-5% SDS, 250 tig /ml salmon sperm 
DNA and then hybridized with probe at 55 °C and washed 
in 4 x SSC, 0.1% SDS at 60°C 3 times for 20 minutes 
each and then in 2 x SSC, 0.1% SDS once at 60°C for 20 
minutes. Northern blot experiments showed that the 
single-stranded RNA probe transcribed by T3 RNA 
polymerase hybridized to the 2.2 kb ced-4 mRNA, while 
the probe made by T7 RNA polymerase did not. This 
result indicates that the direction of the 
transcription is from the BamEI site toward the Hindlll 
site of pnl416. 

Determination of DNA Sequence 

For determining DNA sequences, serial deletions 
were made according to Henikoff (Gene 28:351-359 
(1984)). DNA sequences were determined using Sequenase 
and protocols obtained from US Biochemicals (Cleveland, 
OH) . The ced-4 DNA sequence was confirmed by 
sequencing both strands of cDNA and genomic DNA clones. 

Cloning of the Cosmid Fragment C10D8-5 

The cosmid C10D8 was digested with EcoRl. Two 
EcoRI fragments of 2.2 kb (r5) and 2.4 kb (r7) , both of 
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which hybridized to a mixture of ced-4 cDNA subclones 
SK2-1 and SK2-2, were isolated, r7 , which hybridized 
to SK2-1, which contains the 3' half of ced-4 cDNA 
clone SK2, was cloned into the EcoRI site of plasmid 
5 pBSKII (Stratagene) . The EcoRI site at the 3' end of 
r7 was deleted by digesting with Styl, which cut once 
at 0.2 kb from the 3' end of the insert, and Sail, 
which cut once in the poly linker, and then religating. 
The deleted r7 plasmid was linearized with EcoRI and 
10 ligated with EcoRI -digested r5, which hybridized to 
Sk2-2, the 5' half of ced-4 cDNA SK2 . Clones were 
analyzed for the correct orientation of the r5 insert 
based on the cDNA restriction map. One such correctly 
oriented clone was named C10D8-5. 

15 Microinjection and Transformation 



that of Fire (EMBO J. 5:2673-2680 (1986)) with 
modifications introduced by J- Sulston. Cosmid DNA to 

20 be injected was purified twice using CsCl-gradient 

centrifugation (Maniatis et al . , 1983 supra). Plasmid 
DNA to be injected was prepared by alkaline minipreps 
(Maniatis et'al., 1983 supra). DNA was treated with 
RNAase A (37°C, 30 minutes) and then with proteinase K 

25 (55°C, 30 minutes), extracted with phenol and then 

chloroform, precipitated twice (first in 0.3 M sodium 
acetate and then in 0.1 M potassium acetate, pH 7.2), 
and resuspended in 5 ul of injection buffer (Fire, 1986 
supra). DNA concentrations used for injection were 

30 0.1-1.0 mg/ml. 



The procedure for microinjecting DNA into the. 
gonad to obtain germline transf ormants was basically 



All transformation experiments used a ced-1; 
ced-4 (nll62) ; unc-31 strain as the recipient. The 
expression of the Ced-4 phenotype was quantified by 
counting the number of cell corpses in the heads of 
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young LI animals. The cosmid C10D8 or plasmid 
subclones of C10D8 were mixed with cosmid C14G10, which 
contains the wild-type unc-31(+) gene, at a ratio of 
2:1 or 3:1 to increase the likelihood that a 
5 phenotypically non-Unc transformant would contain the 
cosmid or plasmid being tested. Generally, 20-30 
animals were injected in one experiment. Non-Unc Fl 
progeny of injected animals were isolated three to four 
days later. About 1/2 to 1/3 of the non-Unc progeny 
10 transmitted the non-Unc phenotype to their progeny and 
could be established as lines of transf ormants . Young 
LI non-Unc progeny of such non-Unc transf ormants were 
examined using Nomarski optics to determine the number 
of cell corpses present in the heads. 

15 Ced-4 Fusion Protein and Antibody Preparation 

To express a Ced-4 fusion protein in E. coli , ,a 
clone containing both the 5' and 3' halves of the ced-4 
cDNA (SK2-2 and SX2-1) in the expression vector pET-5a 
(Rosenberg et al . , Gene 55:125-135 (1987)) was 

20 constructed. The fusion protein expressed by this 

vector was expected to include 11 amino acids of phage 
T7 gene 10 protein, 5 amino acids of linker and the 546 
amino acids encoded by ced-4 cDNA SK2 . The pJ7 6 
plasmid, which encodes this fusion protein, was 

25 transformed into bacterial strain BL21. ced-4 fusion 
protein was produced by this transformed strain, as 
expected, and subjected to electrophoresis on a 
polyacrylamide gel. A band, with mobility equivalent 
to about 64 x 10 3 Mr, specific to the transformed 

30 strain was exercised and used to immunize three 

rabbits. Sera from all three rabbits tested positive 
on western blots (Towbin et al . , Proc . Natl. Acad. Sci. 
USA 76:4350-4354 (1979)). These sera were purified 
using immunoblots (Harlow and Lane, Antibodies : A 
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Laboratory Manual, Cold Spring Harbor Laboratory, 
1988) . 

RESULTS 

Cloning of the ced-4 Gene bv Transposon Tagging 
5 The ced-4 allele nl416 in the C. elegans strain 

TR679 was isolated, which carries the mutator mut- 
2(r459) and shows an elevated frequency of 
transposition elements (Collins et al . , Nature 328:726- 
728 (1987); Yuan et al . , Proc. Natl. Acad. Sci . USA 

10 88:3334-3338 (1991)). The ced-4 (n!416) mutation is 
closely linked to a newly transposed copy of the C. 
elegans transposon Tc4 (Yuan et al . , 1991 supra). 
Using Tc4 as a probe, this novel Tc4 element and its 
flanking region was cloned as a 5 kb Jfindlll fragment 

15 from a 4-6 kb size-selected ced-4 (n!416) genomic phage 
library. A 3 kb adjacent to this Tc4 element was 
isolated by digesting the 5 kb Hin&IlI fragment with 
BamEI. This 3 kb fragment, called pnl416, was cloned 
into the Bluescribe M13 + plasmid vector (Stratagene) . 

2 0 When used as a probe on Southern blots, pnl416 

hybridized to, a 3.4 kb Hindlll fragment in DNA of wild- 
type (strain N2) and two non-Ced revertants of 
ced-4 (nl416) , ced-4 (n!416 n!712) and ced-4 (nl416 n!713) 
(Yuan and Horvitz, Dev. Biol. 238:33-41 (1990)), and a 
25 5 kb Hindlll fragment in ced-4 (nl416) animals. The 
hybridizing band in ced-4 (n!416) DNA is 1.6 kb larger 
than that of the wild-type or the revertants, 
indicating that an insertion of this size is present in 
the ced-4 (nl416) strain and is deleted in both 

3 0 revertants. These observations indicate that the Tc4 

insertion in ced-4 (nl416) animals is responsible for 
their Ced-4 mutant phenotype and suggest that pnl416 
contains at least part of the ced-4 gene. 
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To isolate additional genomic DNA from the region 
of this Tc4 insertion, pnl4l6 was used to probe a C. 
elegans Bristol N2 genomic DNA phage library. Five 
phage clones with inserts of 10 to 
5 15 kb were isolated and shown to share a 3 kb SamHI- 
tfindlll fragment that hybridized to pnl416. These 
phage clones were used to identify cosmids that 
hybridized to them and that were members of a 600 kb 
contig of overlapping cosmids (Coulson et al . , Proc. 
10 Natl. Acad. Sci . USA 83:7821-7825 (1986)). By using 
the phage clones as probes to hybridize to Southern 
blots, a cosmid C10D8 was identified as containing all 
regions of genomic DNA present in all five phage clones 
and in pnl416. 

15 ThP ced-4 Mutant Ph e nntvne Can Be Rescued by a 4.4 kb 

dna Fragment i 

To identify ced-4(+) DNA capable of complementing 
the Ced-4 mutant phenotype, the cosmid C10D8 was 
injected into the oocytes of ced-4 (nll62) animals. To 

20 facilitate the identification of transgenic animals, a 
mutation in the unc-31 gene, which affects locomotion, 
was included as a marker for co-transformation (Kim and 
Horvitz, Genes & Dev. 4:357-371 (1990)). Cosmid 
C14G10, which contains the wild-type allele of unc-31 

25 and does not have Ced-4 -rescuing activity was 
coinjected with cosmid C10D8 into ced-1 (el735) ; 
unc-31 (e928) ; ced-4 (n!162) animals. The ced-1 mutation 
was included to facilitate the scoring of the ced-4 
mutant phenotype (Ellis and Horvitz, Cell 44:817-829 

30 (1986)). Specifically, when a cell undergoes 

programmed cell death in C. elegans, its corpse is 
quickly engulfed and destroyed by a neighboring cell 
(Robertson and Thomson, J. Embryol . Exp. Morph. 57:89- 
100 (1982); Sulston et al . , Dev. Biol. 100:64-119 
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(1983)). A ced-1 mutation prevents this engulf ment, 
allowing the cell corpse to remain intact (Hedgecock et 
al., Science 220:1277-1280 (1983)). Thus, in a first 
or second stage (LI or L2) ced-1 mutant larva, many 
cell corpses are present and can be easily visualized 
using Normaski optics. ced-4 mutations prevent cell 
death and the appearance of these corpses. Thus, 
suppression of the Ced-4 mutant phenotype by a wild- 
type ced-4 gene can be observed and readily quantified 
in a ced-1 mutant background based on an increase in 
the number of visible cell corpses. 

From one such microinjection experiment, three 
non-Unc animals rescued for the Unc-31 mutant phenotype 
were picked from among the Fl progeny, and from one of 
them a line of non-Unc transf ormants was obtained. No 
true-breeding non-Unc animals could be isolated from 
this line: about 25% of the progeny of all non-Unc, 
animals were. Unc. Since no inviable zygotes were 
observed among the progeny of these non-Unc animals, 
this transformant did not carry a recessive lethal 
insertion mutation. Rather, it seems likely that the 
injected DNA was maintained as an extra chromosomal 
array that was segregated to only some gametes, as has 
been reported previously for many other C. elegans 
transgenic strains (e.g., Stinchcomb et al., Mol. Cell 
Biol. 82:110-156 (1985); Fire, EMBO J. 5:2673-2680 
(1986); Way and Chalfie, Cell 54:5-16 (1988)). This 
putative extrachromosomal array was named nExl. Young 
LI progeny of nExl -containing animals were examined 
3 0 using Nomarski optics for the Ced-4 phenotype. 

Young LI ced-1 animals have an average of 23 cell 
corpses in the head, while ced-1 (ell '35) ; ced-4(nll62) 
animals have an average of 0.6 cell corpses (Ellis and 
Horvitz, Cell 44:817-829 (1936)). Young LI ced-1; 
35 C ed-4(nll62); nExl animals had an average of nine cell 
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corpses in the head. These results indicate that 
cosinid C10D8 restored significant, but not total, 
ced-4(+) activity in the transf ormants . 

To delineate the ced-4 gene within C10D8, various - 
5 subclones of C10D8 were injected into ced-4 mutant 
animals and tested for their ability to rescue the 
Ced-4 mutant phenotype (Table 1) . The smallest 
subclone plasmid that could rescue the ced-4 phenotype 
as effectively as cosmid C10D8 was a 4.4 kb fragment, 

10 called C10D8-5. C10D8-5 and the unc-32 (+) -containing 
cosmid C14G10 were coinjected into ced-1; unc-31; 
ced-4 (nll62) animals. Two lines of non-Unc 
transf ormants were isolated. Since these animals 
continued to segregate Unc animals and did not produce 

15 inviable zygotes, both appeared to carry 

extrachromosomal arrays, which were designated nZx7 and 
nExQ. Young LI animals from these transf ormant strains 
had an average of 11.5 cell corpses in their heads, 
indicating that plasmid C10D8-5 restored ced-4 (+) 

20 activity as well as did cosmid C10D8 (Table 1) . 

Identification of a ced-4 Transcript 

Restriction sites of plasmid C10D8-5 (which can 
rescue the Ced-4 phenotype) and pnl416 (which contains 
sequences adjacent to the Tc4 insertion site) were 
2 5 mapped. C10D805 was found to overlap with 2 kb of 
sequence in pnl416, including the Tc4 insertion site 
(Figure 8) . 

In Northern blot experiments, both pnl416 and 
C10D8-5 were used to probe poly (A) + RNA populations of 
30 mixed developmental stages of wild-type (strain N2), 
ced-4 (nl416) , and ced-4 (nl416 nl712) and ced-4 (nl416 
7}1713;' : 'revertant animals. pnl416 hybridized to a 2.2 
kb transcript and an 0.9 kb transcript in RNA from N2 
animals, and a 3 kb transcript, a transcript slightly 




larger than the wild-type 2.2 kb transcript, and a 
transcript slightly smaller than the wild-type 0.9 kb 
transcript in ced-4 (n!416) animals. The 3.8 kb RNA 
contained Tc4 sequence (see below) , suggesting that 
5 this RNA resulted from the insertion of the 1.6 kb Tc4 
sequence into the ced-4 sequence encoding 2.2 kb 
transcript. The transcript slightly larger than the 
2.2 kb wild-type transcript did not contain Tc4 
sequence. This ced-4 (nl416) RNA might have been an 

10 aberrant transcript containing sequences adjacent to 
the ced-4 gene: when pnl416 was used as a probe, the 
wild-type 2.2 kb and the slightly larger transcript in 
this mutant were relatively similar in intensities, 
whereas when ced-4 cDNA clone SK2-1 was used as a 

15 probe, this mutant transcript was not detected (see 
below) . These observations indicate that the 
ced-4 (nl416) 2 . 2 kb transcript contains sequences from 
the ced-4 region but does not contain sequences 
corresponding to at least the 3' half of the ced-4 

20 mRNA. The two revertants of ced-4 (nl416) , ced-4 (nl416 
nl712) and ced-4 (n!416 nil 13) , contained both 2.2 kb 
and 0.9 kb transcripts with similar sizes to the wild- 
type transcripts. Thus, both the 2.2 kb and the 0.9 kb 
transcripts Were altered in ced-4 (nl416) animals, and 

25 both were restored in the two non-Ced revertants. 

To determine if any of the transcripts contains 
Tc4 sequence, the Northern blots were probed with 
Tc4 -nl35I, which contains the 1.6 kb Tc4 element 
present in the Tc4-induced mutant unc-86 (nl351) as well 

30 as 4 kb of unc-S5 sequences. Tc4-nI352 hybridized both 
to a 3.8 kb transcript of the Tc4-induced mutant 
ced-4 (n!416) and to a 1.5 kb unc-68 transcript in both 
ced-4 (nl416) and N2 animals. 

To determine whether one or both of the 2.2 kb and 

35 0.9 kb transcripts are encoded by ced-4, subclone 



C10D8-5, which rescued the Ced-4 phenotype, was used to 
probe the Northern blots. C10D8-5 detected the wild- 
type 2.2 kb transcript, the ced-4 (nl416) transcript 
slightly larger than the 2.2 kb transcript, and the 

5 ced-4 (nl416) 3 . 8 kb transcript. C10D8-5 did not 
hybridize to the 0.9 kb transcript, indicating that 
this transcript is unlikely to be encoded by ced-4. 
C10D8-5 also detected a 1.4 kb transcript, which was 
not altered by the Tc4 insertion in ced-4 (nl416) 

10 animals. Only a 470 bp EcoRI-StuI fragment at one end 
of C10D8-5 hybridized to this 1.4 kb RNA. Since 
C10D8-5 did not contain the complete coding region for 
this RNA, and since this RNA was unaffected in 
ced-4 (nl416) animals, this 1.4 kb RNA seems unlikely to 

15 be a ced-4 transcript. The relationships among cosmid 
C10D8-5, pnl416 and the 0.9 kb, 1.4 kb and 2.2 kb 
transcripts are summarized in Figure 8. 

On Northern blots probed with the ced-4 cDNA clone 
SK2-1, the level of the 2.2 kb transcript showed 

20 significant reduction in all three independently 

derived EMS-induced ced-4 mutants examined, strongly 
supporting the hypothesis that this 2.2 kb transcript 
is a ced-4 transcript. Total RNA from N2, 
ced-4 (nll62) / ced-4 (nl416) , ced-4 (nl894) and 

25 ced-4 (nl920) eggs was probed with 32 P-labelled ced-4 
cDNA SK2-1. An actin 1 probe (Krause and Hirsh, in: 
Molecular Biology of the Cytoskeleton , Borisy et al . 
(eds.), Cold Spring Harbor Laboratory, 1984, pp. 287- 
292) was used as an internal control for the amount of 

3 0 RNA loaded in each lane. The ratios of the intensity 
of the ced-4 band to that of actin band in N2 , nll62 , 
nl416 and nl894 were 0.5, 0.17, 0 and 0.12, 
respectively. A Northern blot of poly (A) + RNA from 
stage-synchronized animals was probed with pnl416, 

3 5 which hybridizes both to the 2.2 kb ced-4 transcript 
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and to a 0.9 kb transcript. The 0.9 kb transcript 
seems to be expressed mostly in eggs and adults. The 
presence of RNA in all lanes was confirmed by loading 
1/10 of each sample on another gel and probing a 
5 Northern blot from this gel using the C. elegans actin 
1 gene (Krause and Hirsh, 1984 supra) . That all of 
these distinct ced-4 mutations cause reduced levels of 
a ced-4 transcript could reflect either instability of 
all three mutant transcripts or a role for ced-4 in 

10 regulating its own expression. 

Based upon these results, it can be concluded that 
the 2.2 kb RNA is a ced-4 transcript. It is not known 
why the 0.9 kb RNA is also altered in ced-4 (nl416) 
animals. Perhaps transcription of the 0.9 kb RNA is 

15 initiated incorrectly as a consequence of the nearby 
Tc4 element. 

r.*d-4 Expression is p -rimarilv Embryonic 

A Northern blot containing RNAs from stage- 
synchronized animals of different developmental stages 

20 probed with pnl416 showed that the 2.2 kb ced-4 
transcript was expressed primarily during 
embryogenesis. This result is consistent with the 
observation that 113 of the 131 programmed cell deaths 
in the C. elegans hermaphrodite are embryonic (Sulston 

25 and Horvitz, Dev. Biol. 82:110-156 (1977); Sulston et 
al.. Dev. Biol. 200:64-119 (1983)). The 2.2 kb RNA was 
relatively abundant during embryonic development. The 
0.9 kb transcript was expressed mostly in eggs and 
adults. The presence of RNA in all lanes was confirmed 

3 0 by loading 1/10 of each sample on another gel and 
probing a Northern blot from this gel with the C. 
elegans actin 1 gene (Krause and Hirsh, 1984 supra). 
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ThP TranscriDt is Present in a ced-1 Mutant 

The activities of both ced-3 and ced-4 are 
required for programmed cell death (Ellis and Horvitz, 
Cell 44:817-819 (1986)). One possibility is that one 
5 of these genes positively regulates the expression of 
the other. For this reason, a Northern blot of wild- 
type strain N2 and ced-3 (n717) poly (A) + RNA was probed 
with pnl416. This experiment showed that the 2.2 kb 
ced-4 transcript was present at an apparently normal 
10 level in this ced-3 mutant. Thus, the activity of the 
ced-3 gene is unlikely to be necessary for the 
expression of the cad-4 2.2 kb transcript. 

Tripnt.i f ication of ce d-4 cDNA Clones 

To isolate cDNA clones of ced-4, pnl416 was used 
15 to probe a C. elegans cDNA phage library made from 

wild-type strain N2 mixed-stage RNA (Kim and Horvitz, 
Genes & Dev 4:357-371 (1990)). Two cDNA clones were 
isolated. The two cDNA clones (named SKI and SK2) 
hybridized to the 2.2 kb ced-4 transcript. Both are 
20 about 1.8 kb in size, and both contain one 0.8 kb and 
one 1.0 kb EcoRI fragment. These EcoRI fragments were 
subcloned into plasmid vector Bluescribe M13+ 
(Stratagene) The two subclones derived from SKI were 
named SK1-1 and SK1-2, and the two subclones derived 
25 from SK2 were named SK2-1 and SK2-2. The restriction 
maps of the SKI- and SK2-derived clones were the same. 
Sequence analysis of the ends of the four cDNA 
subclones confirmed the equivalence of the SKI and SK2 
clones, except that SK1-2 contains a poly (A) sequence 
30 of more than 50 bp at its 5' end. This poly(A) 

sequence is probably a cDNA cloning artifact, since 
SK1-2' contains the 5' half of the cDNA (see below). 
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yhe ced-4 Sequence 

The DNA sequence of the SK2 1.8 kb cDNA clone was 
determined. This sequence includes an open reading 
frame encoding 546 amino acids (Figure l; Seq. ID #2) , _ 
which is consistent with the results of Northern blot 
analysis using single-stranded RNA probes. An ochre 
termination codon (TAA) is located in-frame near the 3' 
end, indicating that the 3' end at the 2.2 kb 
transcript is most likely included in this cDNA. The 
open reading frame extends to the 5' end of the 1.8 kb 
cDNA, suggesting that this cDNA might lack the 5' end 
of the ced-4 coding region. 

A primer extension experiment was performed to 
determine the ced-4 transcription initiation site(s) 
using the primer ATTGGCGATCCTCTCGA (Seq. ID #23) and 
C10D8-5 as template. A major transcriptional 
initiation site was identified at 54 bp before (5' of) 
the beginning of the ced-4 cDNA SK2 and a minor 
initiation site at 54 bp after (3' of) the beginning of 
this cDNA (Figure 1) . The first AUG codon after the 
presumptive major start site is located at 9 bp before 
the beginning of the cDNA (Figure 1). If this site is 
used to initiate protein synthesis, the Ced-4 protein 
would be 549, -amino acids in length. The first AUG 
codon after the presumptive minor start site is located 
at 13 0 bp after the beginning of the cDNA. If this 
site is used, the Ced-4 protein would be 503 amino 
acids in length. Preliminary results using an anti- 
Ced-4 antibody raised against a Ced-4 fusion protein 
showed that endogenous Ced-4 protein is slightly 
smaller in molecular weight than a Ced-4 fusion protein 
of 562 amino acids expressed in E. coli. Thus, most 
Ced-4' protein is probably initiated near the start of 
the cDNA and is presumably 549 amino acids in length 
and 62,977 in relative molecular mass. The direction 
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of the open reading frame is consistent with the 
direction of transcription, as demonstrated by probing 
Northern blots with single-stranded RNA probes. The 
presumptive Ced-4 protein is highly hydrophilic, with a. 
pi of 5.12. The longest hydrophobic region is a 
segment of 12 amino acids from residues 382 to 393. 

A Western blot of wild-type strain N2 mixed-stage, 
ced-4 (nl416) mixed-stage, wild-type egg, and 
bacterially expressed protein (pJ76) was probed using 
anti-Ced-4 antibody. Ced-4 fusion protein (pJ76) was 
made by cloning ced-4 cDNA SK2 into the T7 expression 
vector pET-5a (Rosenberg et al., Gene 55:125-135 
(1987)), so that 546 amino acids of Ced-4 sequence were 
fused to 11 amino acids of T7 gene 10 protein and 5 
15 amino acids of linker sequence. This Ced-4 fusion 
protein is similar in relative molecular mass to the 
endogenous Ced-4 protein, which is present in wild-type 
(N2) but missing in ced-4 (nl416) animals. The proteins 
phosphorylase b, 97 x 10 3 ; bovine serum albumin, 66 x 
20 10 3 (Hirayama et al., Biochem. Biophys, Res. Comm. 

173:639-646 (1990)); and ovalbumin, 43 x 10 3 , were used 
as molecular weight standards. 

To confirm the DNA sequence obtained from the 
ced-4 cDNAs and to study the structure of the ced-4 
25 gene, the sequences of the 4.4 kb cosmid subclone 

C10D8-5, the 3 kb insert pnl416, and the 2 kb ffindlll- 
BamEl fragment that contains the Tc4 insertion in the 
ced-4 (nl416) mutant were determined. Comparison of the 
ced-4 genomic and cDNA sequences revealed that the 
ced-4 gene has seven introns of sizes ranging from 44 
bp to 557 bp (Figure 2). The exon sequences of genomic 
clone C10D8-5 are identical to the sequences of ced-4 
cDNA Comparison of the Tc4 insertion site in 

ced-4 (nl416) DNA with the ced-4 (+) genomic and cDNA 
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sequences indicated that Tc4 inserted into an exon in 
the ced-4 gene in ced-4 (n!416) animals (Figure 2). 

The DNA sequences of eight EMS-induced ced-4 
alleles were also determined (Table 2). One of the 
5 eight, nl948 , is a missense mutation. Of the seven 

others, four create stop codons and three are predicted 
to affect splicing of the ced-4 transcript. The 
positions of these mutations are indicated in Figure 2. 
These findings indicate that the phenotypes of these 
10 mutants (Ellis and Horvitz, Cell 44:817-829 (1986)) 
result from a complete loss of ced-4 gene function. 
These mutations establish the null phenotype of the 
ced-4 gene, confirming that ced-4 function is not 
essential for viability. 

15 The Ced-4 Protein Has Two Regions Similar to Known 
Calcium-Bindina Domains , 

By direct inspection, the sequence of the putative 
Ced-4 protein was compared with the consensus sequence 
of the calcium-binding loop of the EF-hand domain 

20 (Tufty and Kretsinger, Science 187:161-171 (1975); 
Kretsinger, Cold Spring Harbor Symp. Quant. Biol. 
52:499-510 (1987); Szebenyi and Moffat, J. Biol. Chem. 
26:8761-8777 '(1986) ) . Two regions of the Ced-4 protein 
were identified that might bind calcium (Figure 3). 

25 The EF-hand is a 29 amino acid domain consisting 

of a helix-loop-helix region, with the loop portion 
(residues 10-21) coordinating calcium-binding via the 
side-chain oxygens of serine, threonine, asparagine, 
aspartic acid, glutamine or glutamic acid. These 

30 residues occur at five of the vertices of an 

octahedron: X (position 10), Y (12), Z (14), -X (18), 
-Z (21) . EF-hand amino acid sequences vary 
considerably in the residues present in the calcium- 
binding loop (Figure 3) , and some EF-hand domains have 



only one helical region (Xretsinger, 1987 supra) . The 
consensus sequence is shown at the top of Figure 3. 
Positions Y, Z, and -X can have any of a number of 
amino acids which have oxygen-containing side chains. 
5 Position X is usually aspartic acid, and position -Z is 
usually glutamic acid. 

The sequences of parvalbumins from carp muscle 
(Seq. ID #3; Nockolds et al . , Proc. Natl. Acad. Sci. 
USA 65:581-584 (1972)), the intestinal calcium-binding 
10 protein (ICaBP) (Seq. ID #7-8; Szebenyi et al., Nature 
254:327-332 (1981)), troponin C (Seq. ID #9-12; Collins 
et al., FEBS Lett. 36:268-272 (1973)) and calmodulin 
(Seq. ID #13; Zimmer et al . , J. Biol. Chem. 263:19,370- 
19,383 (1988); Babu et al., Nature 315:37-40 (1985)) 
15 show canonical EF-hands. The hake and ray parvalbumins 
(Seq. ID #4-5; Capony et al . Eur. J. Biochem. 32:97-108 
(1973)); Thatcher and Pechere, Eur. J. Biochem. 75:121- 
132 (1977)), sarcoplasmic calcium-binding protein 
(SCBP) from the protochordate Amphioxus (Seq. ID #6; 
20 Takagi et al., Biochemistry 25: 3585-3592 (1986)), 
trypsinogen (Seq. ID #14; Bode and Schwager, J. Mol . 
Biol. 58:693-717 (1975)), fibrinogen (Seq. ID #15; 
Doolittle, Ann. Rev. Biochem. 53:195-229 (1984); Dang 
et al., J. Biol. Chem. 260:9713-9719 (1985)), villin 
25 (Seq. ID #16; Hesterberg and Weber, J. Biol. Chem. 

258:365-369 (1983)) and galactose-binding protein (GBP) 
(Seq. ID #17; Vyas et al., Nature 327:635-638 (1987)) 
show variations from the consensus sequence. GBP does 
not contain the helices of the EF-hand. 
3 0 The potential calcium-binding loops of sequence 1 

and sequence 2 are located at amino acids 77-8 8 and 
amino acids 292-303 of the Ced-4 protein, respectively 
(Figure''3). In its putative calcium-binding loop, the 
first potential EF-hand-like sequence of the Ced-4 
35 protein has four (positions Y , Z, -X, -Z) of the five 



conserved residues with oxygen-containing side chains 
(shown in bold), and the fifth position (X) has a 
tyrosine rather than an aspartic acid; tyrosine 
contains oxygen in its side chain. The second 

5 potential EF-hand-like sequence of the Ced-4 protein 
has three residues (positions Z, -X, -Z) that match the 
consensus sequence, and amino acids with oxygen- 
containing side chains at the other two positions. 
These observations suggest that these two regions of 

10 the Ced-4 protein might bind calcium. Like the Ced-4 
protein, a number of known calcium-binding proteins, 
such a bovine intestinal calcium-binding protein 
(ICaBP) (Szebenyi and Moffat, 1986 supra) , rabbit 
troponin C (Collins et al . , 1973 supra), trypsinogen 

15 and villin (Doolittle, 1984 supra; Danget et al . , 1985 
supra) have only three or four conserved residues at 
these five positions (Figure 3). The EF-hand domains 
in ICaBP and troponin C have been shown by X-ray 
crystallography to bind calcium. 

20 One major difference between the Ced-4 protein and 

the calcium-binding loop of the EF-hand consensus 
sequence is at position 15. Here, the two Ced-4 
sequences have a histidine and a glutamic acid, 
respectively/ whereas most ET-hand-containing proteins 

25 have a glycine; this glycine has been suggested to be 
important for the turning of the loop (Kretsinger, 1987 
supra) . However, a histidine is present at this 
position in a parvalbumin and an aspartic acid is 
present in another parvalbumin and also in a 

30 sarcoplasmic calcium-binding protein (Kretsinger, 1987 
supra) (Figure 3). Thus, the presence of histidine or 
glutamic acid at position 15 does not rule out the 
possibility that these regions bind calcium. 

The calcium-binding loop (positions 10-21) of the 

35 EF-hand is thought to be preceded (positions 1-9) and 




followed by alpha-helical domains (positions 22-29) 
(Kretsinger, 1987 supra) . Since position 3 of Ced-4 
sequence 1 and positions 26 and 28 of Ced-4 sequence 2 
are prolines, these regions might not form alpha - 
5 helices. However, the known calcium-binding protein 
galactose-binding protein (GBP) has a calcium-binding 
domain similar to that of the EF-hand (Figure 3) but 
without the two helices; furthermore, position 29 of 
GBP is proline (Vyas et al . , 1987 supra). Thus, the 
10 Ced-4 protein need not contain such alpha-helical 
calcium-binding domains. 

Based upon these considerations, it seems likely 
that the Ced-4 protein binds calcium or a similar 
divalent cation. 

15 EXAMPLE 2 

CLONING. SEQUENCING . AND CHARACT ERIZATION OF 
THE CED-3 GENE 

MATERIALS AND METHODS 

General Methods and Strains 

20 The techniques used for the culturing of C. 

elegans were as described by Brenner (Genetics 77:71-94 
(1974))- All strains were grown at 20°C. The wild- 
type parent strains were C. elegans variety Bristol 
strain N2 , Bergerac strain EM1002 (Emmons et al . , Cell 

25 32:55-65 (1983)), C. briggsae and C. vulgaris (obtained 
from V. Ambros) . The genetic markers used are 
described below. These markers have been described by 
Brenner (1974 supra), and Hodgkin et al - (In: The 
Nematode Caenorhabditis elegans, Wood and the Community 

30 of C. elegans Researchers (eds.), Cold Spring Harbor 
Laboratory, 1938, pp 491-584). Genetic nomenclature 
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follows the standard system (Horvitz et al., Mol. Gen. 
Genet. 175:129-133 (1979)). 

LG I: ced-l(el375) ; unc-54 (r323) 
LG VI: unc-31(e928) , unc-30 (el91) , ced-3 {nil! , n71B, 
5 nl040, nll29, nll63, nll64, nll65, nl286, 

nl949, n2426, n2430, n2433) , IMC-26 (e205) , 
dpy-4 (ell66) 
LG V: egl-1 (n986) ; unc-76(e911) 
LG X: dpy-3(e27) 

10 isolation of Additional Alleles of ced-3 

A non-complementation screen was designed to 
isolate new alleles of ced-3. Because animals 
heterozygous for ced-3 (n717) in trans to a deficiency 
are viable (Ellis and Horvitz, Cell 44:817-829 (1986)), 
15 animals carrying a complete loss-of-function ced-3, 
allele generated by mutagenesis were expected to be 
viable in trans to ced-3 (n717) , even if the new allele 
was inviable in homozygotes. Fourteen EMS mutagenized 
egl-1 males were mated with ced-3 (n717) unc-26 (e205) ; 
20 egl-1 (n487) ; dpy-3(e27) hermaphrodites, egl-1 was used 
as a marker in this screen. Dominant mutations in 
egl-1 cause the two hermaphrodite specific neurons, the 
HSNs, to undergo programmed cell death (Trent et al., 
Genetics 104:619-647 (1983)). The HSNs are required 
25 for normal egg-laying, and egl-1 (n986) hermaphrodites, 
which lack HSNs, are egg-laying defective (Trent et 
al., 1983 supra). The mutant phenotype of egl-1 is 
suppressed in a ced-3; egl-1 strain because mutations 
in ced-3 block programmed cell deaths, egl-1 males 
30 were mutagenized with EMS and crossed with ced-3 (n717) , 
unc-26 (e205) ; egl-1 (n487); dpy-3(e27). Most cross 
progeny were egg-laying defective because they were 
heterozygous for ced-3 and homozygous for egl-1. Rare 




egg-laying competent animals were picked as candidates 
for carrying new alleles of ced-3 . Four such animals 
were isolated from about 10,000 Fl cross progeny of 
EMS-mutagenized animals- These new mutations were made 
5 homozygous to confirm that they carried recessive 
mutations of ced-3 . 

Molecular Biolocrv 

Standard techniques of molecular biology were used 
(Maniatis et al., Molecular Cloning: A Laboratory 
10 Manual, Cold Spring Harbor Laboratory, 1983) • 

Two cosmid libraries were used extensively in this 
work: a Sau3AI partial digest genomic library of 7000 
clones in the vector pHC79 and a Sau3AI partial digest 
genomic library of 6000 clones in the vector pJB8 (Ish- 
15 Horowicz and Burke, Nucleic Acids Res. 9:2989 (1981)). 

The "right" end of MMM-C1 was cloned by cutting it 
with Hindlll and self-ligating. The "left" end of 
MMM-C1 was cloned by cutting it with Bgrill or Sail and 
self-ligating. 

20 The "right" end of Jc8 was made by digesting Jc8 

with EcoRI and self-ligating. The "left" end of Jc8 
was made by digesting Jc8 by Sail and self-ligating. 

C. elegans RNA was extracted using guanidine 
isothiocyanate (Kim and Horvitz, Genes & Dev. 4:357-371 

25 (1990)). Poly(A) + RNA was selected from total RNA by a 
poly(dT) column (Maniatis et al., 1983 supra). To 
prepare stage-synchronized animals, worms were 
synchronized at different developmental stages (Meyer 
and Casson, Genetics 106:29-44 (1986)). 

3 0 For DNA sequencing, serial deletions were made 

according to a procedure developed by Henikoff (Gene 
28:351^359 (1984)). DNA sequences were determined 
using' Sequenase and protocols obtained from US 
Biochemicals with minor modifications. 
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The Tel DNA probe for Southern blots was P Ce2001, 
which contains a Bergerac Tel element (Emmons et al., 
Cell 32:55-65 (1983)). Enzymes were purchased from New 
England Biolabs, and radioactive nucleotides were from 
5 Amersham . 

Primer extension procedures followed the protocol 
by Robert E. Kingston (In: Current Protocols in 
Molecular Biology, Ausubel et al. (eds.), Greene 
Publishing Associates and Wiley-Interscience, New York, 
10 p. 4.8.1) with minor modifications. 

Polymerase chain reaction (PCR) was carried out 
using standard protocols supplied by the GeneAmp Kit 
(Perkin Elmer) . The primers used for primer extension 
and PCR are as follows: 



15 Pex2: 5' TCATCGACTTTTAGATGACTAGAGAACATC 3 

(Seq. ID #24) ; ^ 
Pexl: 5' GTTGCACTGCTTTCACGATCTCCCGTCTCT 3' 

(Seq. ID #25) ; 
SLl: 5' GTTTAATTACCCAAGTTTGAG 3 ' (Seq. ID #26) ; 

20 SL2: 5' GGTTTTAAC CAGTTACTCAAG 3' (Seq. ID #27); 

Log5: 5' CCGGTGACATTGGACACTC 3' (Seq. ID #28); and 

OligolO: 5' ACTATTCAACACTTG 3' (Seq. ID #29). 

Cermline Transf or mation 

The procedure for microinjection basically follows 

25 that of A. Fire {EMBO J. 5:2673-2680 (1986)) with 

modifications: Cosmid DNA was twice purified by CsCl- 
gradient. Miniprep DNA was used when deleted cosmids 
were injected. To prepare miniprep DNA, DNA from 1.5 
ml overnight bacterial culture in superbroth (12 g 

30 Bacto-tryptone, 24 g yeast extract, 8 ml 50% glycerol, 
900 ml H 2 0, autoclaved; after autoclaving, 100 ml 0.17 
M KH 2 P0 4 and 0.72 M KH 2 P0 4 were added) was extracted by 
alkaline lysis method as described in Maniatis et al. 




(1983 supra). DNA was treated with RNase A (37°, 30 
minutes) and then with protease X (55°, 30 minutes), 
extracted with phenol and then chloroform, precipitated 
twice (first in 0.3 M sodium acetate and second in 0.1 . 
5 M potassium acetate, pH 7.2), and resuspended in 5 Ml 
injection buffer as described by A. Fire (1986 supra) . 
The DNA concentration for injection is in the range of 
100 ug to 1 mg per ml. 

All transformation experiments used ced-1 (el735) ; 

10 unc-31 (e92B) ced-3 (n717) strain, unc-32 was used as a 
marker for co-transformation (Kim and Horvitz, 1990 
supra) . ced-1 was present to facilitate scoring of the 
ced-3 phenotype. The mutations in ced-1 block the 
engulfment process of cell death, which makes the 

15 corpses of the dead cells persist much longer than in 
wild-type animals (Hedgecock et al., Science 220:1277- 
1280 (1983)). The ced-3 phenotype was scored as the 
number of dead cells present in the head of young LI 
animals. The cosmid C10D8 or the plasmid subclones of 

20 C10D8 were mixed with C14G10 (unc-31 (+) -containing) at 
a ratio of 2:1 or 3:1 to increase the chances that a 
Unc-31 (+) transf ormant would contain the cosmid or 
plasmid being tested as well. Usually, 20-30 animals 
were injected in one experiment. Non-Unc Fl progeny of 

2 5 the injected animal were isolated three to four days 

later. About 1/2 to 1/3 of the non-Unc progeny 
transmitted the non-Unc phenotype to F2 progeny and 
established a transf ormant line. The young LI progeny 
of such non-Unc transformant were checked for the 

3 0 number of dead cells present in the head using Nomarski 

optics . 




RESULTS 



Isolation of Additional ced-3 Alleles 

All of the ced-3 alleles that existed previously 
were isolated in screens designed to detect viable 
5 mutants displaying the Ced phenotype (Ellis and 

Horvitz, Cell 44:817-829 (1986)). Such screens may 
have systematically missed any class of ced-3 mutations 
that is inviable as homozygotes. For this reason, a 
scheme was designed that could isolate recessive lethal 

10 alleles of ced-3. Four new alleles of ced-3 (nl!63, 
nll64, nll65, n!2B6) were isolated in this way. Since 
new alleles were isolated at a frequency of about 1 in 
2500, close to the frequency expected for the 
generation of null mutations by EMS in an average C. 

15 elegans gene (Brenner, Genetics 77:71-94 (1974); 

Greenwald and Horvitz, Genetics 96:147-160 (1980)), and 
all four alleles are homozygous viable, it was 
concluded that the null allele of ced-3 is viable. 

Mapping RFLPs near ced-3 

2 0 Tel is a C. elegans transposable element that is 

thought to be immobile in the common laboratory Bristol 
strain and in the Bergerac strain (Emmons et al . , Cell 
32:55-65 (1983)). In the Bristol strain, there are 30 
copies of Tel, while in the Bergerac strain, there are 

25 more than 400 copies of Tel (Emmons et al., 1983 supra; 
Finney, Ph.D. thesis, Massachusetts Institute of 
Technology, Cambridge, Massachusetts, 1987) . Because 
the size of the C. elegans genome is small (haploid 
genome size 8 x 10 7 bp) (Sulston and Brenner, Genetics 

30 77:95-104 (1976)), a polymorphism due to Tel between 
the Bristol and Bergerac strains would be expected to 
occur about once every 200 )cb. Restriction fragment 




length polymorphisms (RFLPs) can be used as genetic 
markers and mapped in a manner identical to 
conventional mutant phenotypes. A general scheme has 
been designed to map Tel elements that are dimorphic 
5 between the Bristol and Bergerac strains near any gene 
of interest (Ruvkun et al., Genetics, 121:501-516 
(1989)). Once tight linkage of a particular Tel to a 
gene of interest has been established, that Tel can be 
cloned and used to initiate chromosome walking. 

10 A 5,1 kb Bristol-specific Tel EcoRI fragment was 

tentatively identified as containing the Tel closest to 
ced-3. This Tel fragment was cloned using cosmids from 
a set of Tel-containing C. elegans Bristol genomic DNA 
fragments. DNA was prepared from 4 6 such TC1- 

15 containing cosmids, and this DNA was screened using 
Southern blots to identify the cosmids that contain a 
5.1 kb EcoRI Tcl-containing fragment. Two such cosmids 
were identified: MMM-C1 and MMM-C9. The 5.1 kb EcoRI 
fragment was subcloned from MMM-C1 into pUC13 

2 0 (Promega) . Since both ends of Tel contain an EcoRV 

site (Rosenzweig et al . , Nucleic Acids Res. 11:4201- 
4209 (1983)), EcoRV was used to remove Tel from the 5.1 
kb EcoRI fragment, generating a plasmid that contains 
only the unique flanking region of this Tcl-containing 
25 fragment. This plasmid was then used to map the 
specific Tel without the interference of other Tel 
elements. 

unc-30 (e!91) ced-3 (n717) dpy-4 (e!166) /+++ males 
were crossed with Bergerac (EM1002) hermaphrodites, and 

3 0 Unc non-Dpy or Dpy non-Unc recombinants were picked 

from among the F2 progeny. The recombinants were 
allowed to self-fertilize, and strains that were 
homozyfcfdus for either unc-30 (e!91) dpy-4 (Bergerac) or 
unc-30 (Bergerac) dpy-4 (el!66) were isolated. After 
35 identifying the ced genotypes of these recombinant 




strains, DNA was prepared from these strains, A 
Southern blot of DNA from these recombinants was probed 
with the flanking sequence of the 5.1 kb EcoRI Tel 
fragment. This probe detects a 5.1 kb fragment in 
5 Bristol N2 and a 3.4 kb fragment in Bergerac. Five out 
of five unc-3 0 ced-3 dpy(+Berg) recombinants, and one 
of one unc-30 (+Berg) ced-3 dpy-4 recombinants showed 
the Bristol pattern. Nine of ten unc-30 (+Berg) dpy-4 
recombinants showed the Bergerac pattern. Only one 

10 recombinant of unc-30 (+Berg) dpy-4 resulted from a 
cross-over between ced-3 and the 5.1 kb Tel element. 
The genetic distance between ced-3 and dpy-4 is 2 map 
units (mu) . Thus, this Tel element is located 0.1 mu 
on the right side of ced-3. 

15 Cosmids MMM-C1 and MMM-C9 were used to test 

whether any previously mapped genomic DNA cosmids 
overlapped with these two cosmids. A contig of 
overlapping cosmids was identified that extended the 
cloned region near ced-3 in one direction. 

20 To orient MMM-C1 with respect to this contig, both 

ends of MMM-C1 were subcloned and these subclones were 
used to probe the nearest neighboring cosmid C48D1. 
The "right" end of MMM-C1 does not hybridize to C48D1, 
while the "left" end does. Therefore, the "right" end 

2 5 of MMM-C1 extends further away from the contig. To 

extend this contig, the "right" end of MMM-C1 was used 
to probe the filters of two cosmid libraries (Coulson 
et al., Proc. Natl. Acad. Sci . USA 83:7821-7825 
(1986)). One clone, Jc8, was found to extend MMM-C1 in 
30 the opposite direction of the contig. 

RFLPs between the Bergerac and Bristol strains 
were used to orient the contig with respect to the 
genetic map. Bristol (N2) and Bergerac (EM1002) DNA 
was digested with various restriction enzymes and 

3 5 probed with different cosmids to look for RFLPs. Once 



such an RFLP was found, DNA from recombinants of the 
Bristol and Bergerac strains between ced-3 and unc-26, 
and between unc-30 and ced-3 was used to determine the 
position of the RFLP with respect to ced-3. 

The "right" end of Jc8, which represents one end 
of the contig, detects an RFLP (nP33) when N2 and 
EM1002 DNA was digested with JTijidlll. A Southern blot 
of DNA from recombinants between three ced-3 (+Berg) 
unc-26 was probed with the "right" end of Jc8. Three 
of three +Berg unc-26 recombinants showed the Bristol 
pattern, while two of two ced-3 unc-2 6 (+Berg) 
recombinants showed the Bergerac pattern. Thus, nP33 
mapped very close or to the right side of unc-2 6 . 

The "left" end of Jc8 also detects a ffindlll RFLP 
(nP34) • The same Southern blot was reprobed with the 
Jc8 "left" end. Two of the two ced-3 unc-2 6 (+Berg) 
recombinants and two of the three ced-3 (+Berg) unc-26 
recombinants showed the Bergerac pattern. One of the 
three ced-3 (+Berg) unc-2 6 recombinants showed the 
Bristol pattern. The genetic distance between ced-3 
and unc-2 6 is 0.2 mu. Thus, nP34 was mapped between 
ced-3 and unc-2 6, about 0.1 mu on the right side of 
ced-3 . 

The flanking sequence of the 5.1 kb EcoRI Tel 
fragment (named nP35) was used to probe the same set of 
recombinants. Two of three ced-3 (+Berg) unc-2 6 
recombinants and two of two ced-3 unc-2 6 (+Berg) 
recombinants showed the Bristol pattern. Thus, nP35 
was also found to be located between ced-3 and unc-2 6, 
about 0.1 mu on the right side of ced-3. 

A similar analysis using cosmid T10H5 which 
contains the ifindlll RFLP (nP36) f and cosmid B0564, 
which 'contains a tfindlll RFLP (nP37) , showed that nP36 
and nP37 mapped very close or to the right of unc-30. 
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These experiments localized the ced-3 gene to an 
interval of three cosmids. The positions of the RFLPs, 
and of ced-3, unc-3 0 and unc-2 6 on chromosome IV, and 
their relationships to the cosmids are shown in Figure 
5 9. It was has been further demonstrated by 

microinjection that cosmids C37G8 and C33F2 carry the 
unc-30 gene (John Sulston, personnel communication) . 
Thus, the region containing the ced-3 gene was limited 
to an interval of two cosmids. These results are 
10 summarized in Figure 9. 

Complementation of ced-3 by Germline Transformation 

Cosmids that were candidates for containing the 
ced-3 gene were microinjected into a ced-3 mutant to 
see if they rescue the mutant phenotype. The procedure 

15 for microinjection was that of A. Fire (EMBO J. 5:2673- 
2680 (1986)) with modifications. unc-31, a mutant 
defective in locomotion, was used as a marker for - 
cotransf ormation (Kim and Horvitz, Genes & Dev. 4:357- 
371 (1990)), because the phenotype of ced-3 can be 

2 0 examined only by using Nomarski optics. Cosmid C14G10 
(containing unc-32(+)) and a candidate cosmid were 
coinjected into ced-1 (el375) ; unc-31(e928) ced-3 (n717) 
hermaphrodites, and Fl non-Unc transf ormants were 
isolated to see if the non-Unc phenotype could be 

2 5 transmitted and established as a line of transf ormants . 

Young LI progeny of such transf ormants were examined 
for the presence of cell deaths using Nomarski optics 
to see whether the ced-3 phenotype was suppressed. 
Cosmid C14G10 containing unc-31 alone does not rescue 

3 0 ced-3 activity when injected into a ced-3 mutant. 

Table 4 summarizes the results of these transformation 
experiments . 

As shown in Table 4, of the three cosmids injected 
(C43C9, W07H6 and C48D1) , only C48D1 rescued the ced-3 



phenotype (2/2 non-Unc transf ormants rescued the ced-3 
phenotype) . One of the transf ormants, nEX2 , appears to 
be rescued by an extra-chromosomal array of injected 
cosmids (Way and Chalfie, Cell 54:5-16 (1988)), which 
5 is maintained as an unstable duplication, since only 
50% of the progeny of a non-Unc Ced(+) animal are 
non-Unc Ced(+) • Since the non-Unc Ced(+) phenotype of 
the other transformant (nISl) is transmitted to all of 
its progeny, it is presumably an integrated 

10 transformant. LI ced-I animals contain an average of 
23 cell corpses in the head (Table 5). LI ced-1; ced-3 
animals contain an average of 0.3 cell corpses in the 
head,- ced-I; unc-31 ced-3; nISl and ced-1; unc-31 
ced-3; nEX2 animals contain an average of 16,4 and 14.5 

15 cell corpses in the head, respectively. From these 

results, it was concluded that C48D1 contains the ced-3 
gene. 

In order to locate ced-3 more precisely within the 
cosmid C48D!, this cosmid was subcloned and the 

20 subclones were tested for the ability to rescue ced-3 
mutants (Table 5). C4 8D1 DNA was digested with 
restriction enzymes that cut rarely within the cosmid 
and the remaining cosmid was self-ligated to generate a 
subclone. Such subclones were then injected into a 

25 ced-3 mutant to look for complementation; young LI 
non-Unc progeny of the transf ormants were examined 
using Nomarski optics for the presence of cell death in 
the head. When C4 8D1 was digested with BamEI and self- 
ligated, the remaining 14 kb subclone (named C48D1-28) 

3 0 was found to rescue the ced-3 phenotype when injected 
into a ced-3 mutant (Figure 10 and Table 5) . C48D1-28 
was then partially digested with Bglll and self- 
ligated. Clones of various lengths were isolated and 
tested. for their ability to rescue ced-3. 
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One clone, C48D1-43, which did not contain a 1.7 
kb BglU fragment of C48D1-28, was able to rescue ced-3 
(Figure 10 and Table 5). C48D1-43 was further 
subcloned by digesting with BamHI and Apal to isolate a 
5 10 kb BajnHI -Apal fragment. This fragment was subcloned 
into pBSKII+ to generate pJ40. pJ40 can restore ced-3 + 
phenotype when micro injected into a ced-3 mutant. pJ4 0 
was subcloned by deleting a 2 kb Bgrlll-Apal fragment to 
generate pJ107. pJ107 was also able to rescue the 

10 ced-3 phenotype when microinjected into a ced-3 mutant. 
Deletion of 0.5 kb on the left side of pJ107 could be 
made by ExoIII digestion (as in pJ107del28 and 
pJ107del34) without affecting ced-3 activity; in fact, 
one transgenic line, nEX17 , restores full ced-3 

15 activity. However, the ced-3 rescuing ability was 

significantly reduced when 1 kb was deleted on the left 
side of pJ107 (as in pJ107dell2 and pJ107del27) , and 
the ability was completely eliminated when a 1.8 kb 
Sall-Bgrlll fragment was deleted on the right side of 

20 pJ107 (as in pJ55 and pJ56) , suggesting that this Sail 
site is likely to be in the ced-3 coding region. From 
these experiments, ced-3 was localized to a DNA 
fragment of 7.5 kb. These results are summarized in 
Figure 10 and Table 5. 

25 ced-3 Transcript 

pJ107 was used to probe a Northern blot of N2 RNA 
and detected a band of 2.8 kb. Although this 
transcript is present in 12 ced-3 mutant animals, 
subsequent analysis showed that all 12 ced-3 mutant 
3 0 alleles contain mutations in the genomic DNA that codes 
for this mRNA (see below) , thus establishing this RNA 
as a ced-3 transcript. 

The developmental expression pattern of ced-3 was 
determined by hybridizing a Northern blot of RNA from 



animals of different stages (eggs, LI through L4 larvae 
and young adult) with the ced-3 cDNA subclone pJ118. 
Such analysis revealed that the ced-3 transcript is 
most abundant during embryonic development, which is 
5 the period when most programmed cell deaths occur, but 
it was also detected during the LI through L4 larval 
stages and is present in relatively high levels in 
young adults. This result suggests that ced-3 is not 
only expressed in cells undergoing programmed cell 
10 death . 

Since ced-3 and ced-4 are both required for 
programmed cell death in C. elegans , one of the genes 
might act as a regulator of transcription of the other 
gene. To examine if ced-4 regulates the transcription 

15 of ced-3 , RNA was prepared from eggs of ced-4 mutants 
(nll62, 211416, nl894, and n!920) , and a Northern blot 
was probed with the ced-3 cDNA subclone pJ118. The 
presence of RNA in each lane was confirmed with an" 
actin I probe. Such an experiment showed that the 

20 level of ced-3 transcript is normal in ced-4 mutants. 
This indicates that ced-4 is unlikely to be a 
transcriptional regulator of ced-3 . 

Isolation of 'a ced-3 cDNA 

To isolate cDNA of ced-3, pJ4 0 was used as a probe 

2 5 to screen a cDNA library of N2 (Kim and Horvitz, Genes 
£ Dev. 4:357-371 (1990)). Seven cDNA clones were 
isolated. These cDNAs can be divided into two groups: 
one is 3.5 kb and the other 2.5 kb. One cDNA from each 
group was subcloned and analyzed further. pJ8 5 

30 contains the 3 . 5 kb cDNA. Experiments showed that pJ85 
contains a ced-3 cDNA fused to an unrelated cDNA; on 
Northerm blots of N2 RNA, the pJ85 insert hybridizes to 
two RNA transcripts, and on Southern blots of N2 DNA, 
pJ85 hybridizes to more than one band than pJ40 (ced-3 




genomic DNA) does, pJ87 contains the 2.5 kb cDNA. On 
Northern blots , pJ87 hybridizes to a 2.8 kb RNA and on 
Southern blots, it hybridizes only to bands to which 
pJ40 hybridizes. Thus, pJ87 contains only ced-3 cDNA. 
5 To show that pJ87 does contain the ced-3 cDNA, a 

frameshift mutation was made in the Sail site of pJ40 
corresponding to the Sail site in the pJ87 cDNA. 
Constructs containing the frameshift mutation failed to 
rescue the ced-3 phenotype when microinjected into 
10 ced-3 mutant animals, suggesting that ced-3 activity 
has been eliminated, 

ced-3 Sequence 

The DNA sequence of pJ87 was determined (see 
Figure 4; Seq. ID #18). pJ87 contains an insert of 2.5 

15 kb which has an open reading frame of 503 amino acids 
(Figure 4; Seq. ID #19). The 5' end of the cDNA 
contains 25 bp of poly-A/T sequence, which is probably 
an artifact of cloning and is not present in the 
genomic sequence. The cDNA ends with a poly-A 

20 sequence, suggesting that it contains the complete 3' 
end of the transcript. 1 kb of pJ87 insert is 
untranslated 3' region and not all of it is essential 
for ced-3 expression, since genomic constructs with 
deletions of 380 bp of the 3' end can still rescue 

25 ced-3 mutants (pJ107 and its derivatives, see Figure 
10) . 

To confirm the DNA sequence obtained from the 
ced-3 cDNA and to study the structure of the ced-3 
gene, the genomic sequence of the ced-3 gene in the 
30 plasmid pJ107 was determined (Figure 4; Seq. ID #18). 
Comparison of the ced-3 genomic and cDNA sequences 
revealed that the ced-3 gene has seven introns that 
range in size from 54 bp to 1195 bp (Figure 5A) . The 
four largest introns, as well as sequences 5' of the 
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start codon, were found to contain repetitive elements. 



which have been previously characterized in non-coding 
regions of other C. elegans genes such as fem-1 (Spence 
et al., Cell 50:981-990 (1990)), lin-12 (J. Yochem, 
personal communication), and myoD (Krause et al. , Cell 
63:907-919 (1990)) (Figure 4). Of these, repeat 1 was 
also found in fem-1 and myoD, repeat 3 in lin-12 and 
fem-1, repeat 4 in lin-12 , and repeats 2 and 5 were 
novel repetitive elements . 

A combination of primer extension and PCR 
amplification was used to determine the location and 
nature of the 5' end of the ced-3 transcript. Two 
primers (Pexl and Pex2) were used for the primer 
extension reaction. The Pexl reaction yielded two 
major bands, whereas the Pex2 reaction gave one band. 
The Pex2 band corresponded in size to the smaller band 
from the Pexl reaction, and agreed in length with a 
possible transcript that is trans-spliced to a C. 
elegans splice leader (Bektesh, Genes & Dev., 2:1277- 
1283 (1988)) at a consensus splice acceptor at position 
2166 of the genomic sequence (Figure 4). The nature of 
the larger Pexl band is unclear. 

To confirm the existence of this trans-spliced 
message in wild-type worms, total C. elegans RNA was 
PCR amplified using the SLl-Log5 and SL2-Log5 primer 
pairs, followed by a reamplif ication using the SL1- 
OligolO and SL2-01igolO primer pairs. The SL1 reaction 
yielded a fragment of the predicted length. The 
identity of this fragment was confirmed by sequencing. 
Thus, at least some, if not most, of the ced-3 
transcript is trans-spliced to SL1 . Based on this 
result^ 1 the start codon of the ced-3 message was 
assigned to the methionine encoded at position 2232 of 
the genomic sequence (Figure 4). 



Five types of repetitive elements were found, some of 
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The DNA sequences of 12 EMS-induced ced-3 alleles 
were also determined (Figure 4 and Table 3). Nine of 
the 12 are missense mutations. Two of the 12 are 
nonsense mutations, which might prematurely terminate 
5 the translation of ced-3 . These nonsense ced-3 mutants 
confirmed that the ced-3 gene is not essential for 
viability. One of the 12 mutations is an alteration of 
a conserved splicing acceptor G, and another has a 
change of a 70% conserved C at the splice site, which 

10 could also generate a stop codon even if the splicing 
is correct. Interestingly, these EMS-induced mutations 
are in either the N-terminal quarter or C-terminal half 
of the protein. In fact, 9 of the 12 mutations occur 
within the region of ced-3 that encodes the last 100 

15 amino acids of the protein. Mutations are notably 

absent from the middle part of the ced-3 gene (Figure 



significantly hydrophobic region can be found that 
might be a trans-membrane domain (Figure 6) . The Ced-3 
protein is rich in serine. From amino acid 78 to amino 
acid 205 of the Ced-3 protein, 34 out of 127 amino 

25 acids are serine. Serine is often the target of 

serine/ threonine protein kinases (Edelman, Ann . Rev. 
Biochem. 56:567-613 (1987)). For example, protein 
kinase C can phosphorylate serines when they are 
flanked on their amino and carboxyl sides by basic 

30 residues (Edelman, 1987 supra) . Four of the serines in 
the Ced-3 protein are flanked by arginines (Figure 4) . 
The same serine residues might also be the target of 
related Ser/Thr kinases. 



35 the Ced-3 protein, genomic DNAs containing the ced-3 



5) . 



20 



Ced-3 Protein Contains A Region Rich in Serines 

The Ced-3 protein is very hydrophilic and no 



To identify the functionally important regions of 




genes from two related nematode species, C* briggsae 
and C. vulgaris were cloned and sequenced (Figure 7; 
Seq. ID #20 and 21) . Sequence comparison of the three 
ced-3 genes showed that the non-serine-rich region of 
the proteins is highly conserved. In C. briggsae and 
C. vulgaris , many amino acids in the serine-rich region 
are dissimilar compared to the C. elegans Ced-3 protein 
(Figure 7) . It seems that what is important in the 
serine-rich region is the overall serine-rich feature 
rather than the exact amino acid sequence. 

This hypothesis is also supported by analysis of 
ced-3 mutations in C. elegans: none of the 12 EMS- 
induced mutations is in the serine-rich region, 
suggesting that mutations in this region might not 
affect the function of the Ced-3 protein and thus, 
could not be isolated in the screen for ced-3 mutants. 
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Table 1 
Rescue of the Ced-4 
Phenotype by Qermline Transformation 



Genotype 



No. 

DNA Avg. No. Cell Animals 

Injected Corpses (LI Head) Scored 



ced-1/ ced-4; 
unc-31; nExl 



C10D8; 
C14G10 



9.4 



10 



ced-1/ ced-4; 
unc-31; nEx7 



C10D8-5 
C14G10 



11-5 



10 



ced-1/ ced-4 
unc-31; nEx8 



C10D8-5 
C14G10 



11.5 



10 



ced-1 



None 



23 



20 



ced-1; ced-4 



None 



0.6 



20 
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Table 4 

Summary of Transformation Experiments 
Using Cosmida in the ced-3 Region 



Coamid 
injected 

C43C9; C14G10 

W07H6; C14G10 



C48D1; C14G10 



No. of non-Unc 
tranaf onnants 

1 

3 



Ced-3 
phenotvpe 



+ 
+ 



Strain name 

MT4302 

MT4299 
MT4300 
MT4301 

MT4298 
MT4303 



Animals injected were of genotype: ced-1 (el735) ; unc-31(e929) 
ced-3 (n717) . 



Genotype 
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Table S 

The expression of ced-3 (+) transf ormants 



DNA injected 



Average No. 
cell deaths 
in LI head 



No. 

Animals 
■cored 



ced-1 



23 



20 



ced-1 ; ced-3 



0.3 



10 



ced-1; nisi 
unc-31 ced-3 



C48D1; 
C14G10 



16.4 



20 



ced-1; unc-31 
ced-3; nISl/+ 



14 . 5 



20 



ced-1; unc-31 
ced-3; nEX2 



C48D1; 
C14G10 



13.2 

0 



10/14 
4/14 



ced-1; unc-31 
ced-3; nEXlO 



C48D1-28 ; 
C14G10 



12 



9/10 



1 of 10 



ced-1; unc-31 
ced-3; nEX9 



C48D1-28; 
C14G10 



12 



10 



ced-1; unc-31 
ced-3; nEXll 



C48D1-43 
C14G10 



16.7 



10/13 



Abnormal cell 3/13 
deaths 



ced-1; unc-31 
ced-3; nEX13 



pJ4 0; C14G10 



13 .75 



4/4 



-65- 



Table 5 continued 



ced-1; unc-31 
ced-3; HEX17 



ced-1; xinc-31 
ced-3; n£X18 



ced-1; unc-31 
ced-3; nEX19 



ced-1; unc-31 
ced-3; nEX16 



pJ107del28, 

pJ107del34 

C14G10 



pJ107del28, 
pJ107dell34 
C14G10 



pJ107del28, 

pJ107del34 

G14G10 



pJ107dell2, 

pJ107del27 

C14G10 



23 



12.8 



10. 6 



7.8 



12/14 

2/14 
9/10 

1/10 
5/6 

1/6 
12/12 



Alleles of the genes used are ced-1 (el735) , unc-31 (e928) , and ced- 
3 (n717) . 




Equivalents 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine 
experimentation, many equivalents to the specific 
5 embodiments of the invention described herein. Such 
equivalents are intended to be encompassed by the 
following claims. For example, functional equivalents 
of DNAs and RNAs may be nucleic acid sequences which, 
through the degeneracy of the genetic code, encode the 

10 same proteins as those specifically claimed. 

Functional equivalents of proteins may be substituted 
or modified amino acid sequences, wherein the 
substitution or modification does not change the 
activity or function of the protein. A "silent" amino 

15 acid substitution, such that a chemically similar amino 
acid (e.g., an acidic amino acid with another acidic 
amino acid) is substituted, is an example of how a^ 
functional equivalent of a protein can be produced. 
Functional equivalents of nucleic acids or proteins can 

20 also be produced by deletion of nonessential sequences. 



